1
0
Fork 0
Commit Graph

89 Commits

Author SHA1 Message Date
Sean Sube 4ddd69ba07
fix(api): watch for progress events from leaking workers 2023-03-26 18:22:16 -05:00
Sean Sube e1219cca90
fix(api): close queues after stopping workers 2023-03-26 18:21:53 -05:00
Sean Sube bb5d0635d1
sonar lint 2023-03-26 15:53:20 -05:00
Sean Sube 36bfccae09
fix(api): include worker totals in status endpoint 2023-03-26 15:49:43 -05:00
Sean Sube 2d2283e1eb
fix(api): attempt to read progress updates from recycled workers 2023-03-26 15:31:16 -05:00
Sean Sube 27500eccb5
fix(api): do not move jobs from pending to running until progress is reported 2023-03-26 14:22:59 -05:00
Sean Sube 0af406c47f
only enqueue jobs from progress worker 2023-03-26 14:02:57 -05:00
Sean Sube ccf8d51e08
feat(api): split up status endpoint by job status 2023-03-26 11:57:00 -05:00
Sean Sube ea36082e43
add job count to healthy worker logs 2023-03-26 11:53:06 -05:00
Sean Sube 8eab92a7df
define device on pending job 2023-03-26 11:49:58 -05:00
Sean Sube 83884bcafa
enqueue jobs on idle workers during progress check 2023-03-26 11:48:27 -05:00
Sean Sube 14ade83937
fix(api): enqueue next job when previous one finishes and after recycling worker 2023-03-26 11:41:45 -05:00
Sean Sube f3ab25f671
lint(api): add start method to worker pool 2023-03-26 11:30:07 -05:00
Sean Sube 2b179bebac
fix(api): always reset job counter when creating new device worker 2023-03-26 11:22:03 -05:00
Sean Sube 55e44e8ac9
fix(api): increment job counter for worker when it starts a new job (#283) 2023-03-26 11:18:27 -05:00
Sean Sube e552a5560f
feat(api): check device worker pool and recycle on a regular interval (#284) 2023-03-26 11:09:22 -05:00
Sean Sube aeb71ad50a
lint lock name 2023-03-26 08:30:34 -05:00
Sean Sube 95a61f3573
fix(api): restart worker threads when their respective queues are full 2023-03-25 13:46:12 -05:00
Sean Sube 88f4713e23
fix(api): use lock when restarting workers 2023-03-25 09:47:51 -05:00
Sean Sube 6b4c046867
pass pool to threads 2023-03-22 22:58:46 -05:00
Sean Sube 86c1b29c31
lint(api): extract worker thread main functions (#279) 2023-03-22 22:55:34 -05:00
Sean Sube 4dd68ea6b6
fix(api): restart worker threads if they crash 2023-03-22 19:58:46 -05:00
Sean Sube 0732058aa8
feat(api): detect Textual Inversion type from keys (#262) 2023-03-19 20:16:52 -05:00
Sean Sube aefa5b4613
fix(api): clear job cancelled flag when worker starts a new job (#269) 2023-03-19 17:57:14 -05:00
Sean Sube 2e89fd43d3
fix(api): only remove running jobs from running state 2023-03-18 19:21:40 -05:00
Sean Sube e5862d178c
fix(api): assume inversion tokens are embeddings for now 2023-03-18 18:35:11 -05:00
Sean Sube 1d52dc684d
init last progress on worker context 2023-03-18 17:27:41 -05:00
Sean Sube e08a9aa2ab
add pending job list to pool 2023-03-18 17:26:28 -05:00
Sean Sube 8cbdad3a71
feat(api): add pending field to image ready response 2023-03-18 17:25:13 -05:00
Sean Sube 15b6e036e1
fix(api): maintain list of pending jobs 2023-03-18 17:15:18 -05:00
Sean Sube 588c8c7fdb
fix(api): track last progress within worker 2023-03-18 15:32:49 -05:00
Sean Sube 5106dd48a9
remove another ref to finished queue 2023-03-18 15:27:07 -05:00
Sean Sube b026566ccb
remove remaining references to finished queue and worker 2023-03-18 15:26:19 -05:00
Sean Sube d1565b056e
apply lint, make missing images an error 2023-03-18 15:16:41 -05:00
Sean Sube 7cf5554bef
feat(api): add error flag to image ready response 2023-03-18 15:13:42 -05:00
Sean Sube aec540a524
feat(api): add server setting for CUDA memory limit (#211) 2023-03-18 13:40:37 -05:00
Sean Sube 226710a015
fix(api): use exception level logs 2023-03-16 22:29:07 -05:00
Sean Sube 4b832f3d8d
more lint, more trace 2023-03-16 20:22:20 -05:00
Sean Sube c8c5e9f42e
fix(api): handle more out-of-memory errors in the workers 2023-03-16 18:34:28 -05:00
Sean Sube b2eb406197
fix(api): handle CUDA memory errors in workers 2023-03-15 08:51:29 -05:00
Sean Sube 9555a7a3ea
lint(api): only log new worker message if some workers need to be restarted 2023-03-11 13:30:54 -06:00
Sean Sube 66c42485cb
feat(api): add support for extremely long prompts 2023-03-07 19:00:25 -06:00
Sean Sube 9d9bd1a639
apply lint 2023-03-07 08:02:53 -06:00
Sean Sube 35dc8a0bc4
improve exit logging 2023-03-05 21:37:39 -06:00
Sean Sube c0a01efef4
fix(api): track currently active worker for each device 2023-03-05 21:28:21 -06:00
Sean Sube 57fed94337
fix(api): exit worker on memory allocation errors 2023-03-05 21:11:33 -06:00
Sean Sube cb460a0c59
fix(api): add worker PID to log messages 2023-03-05 20:25:02 -06:00
Sean Sube 4ae4ce176c
fix(api): attempt to recycle leaking workers when a job finishes 2023-03-05 20:13:28 -06:00
Sean Sube 3a4928e59b
fix(api): prevent workers from blocking on their progress queues 2023-03-05 20:07:06 -06:00
Sean Sube edc55ae8b4
fix(api): finished job notification should not block worker 2023-03-05 19:53:44 -06:00