Sean Sube
|
af1c3c7839
|
fix(api): track items removed from cache
|
2023-03-06 07:34:09 -06:00 |
Sean Sube
|
339868de4d
|
fix(api): use pipeline class name in cache key
|
2023-03-05 21:54:09 -06:00 |
Sean Sube
|
55576ae87c
|
fix(api): include cache keys in logs
|
2023-03-05 21:50:01 -06:00 |
Sean Sube
|
35dc8a0bc4
|
improve exit logging
|
2023-03-05 21:37:39 -06:00 |
Sean Sube
|
c0a01efef4
|
fix(api): track currently active worker for each device
|
2023-03-05 21:28:21 -06:00 |
Sean Sube
|
57fed94337
|
fix(api): exit worker on memory allocation errors
|
2023-03-05 21:11:33 -06:00 |
Sean Sube
|
cb460a0c59
|
fix(api): add worker PID to log messages
|
2023-03-05 20:25:02 -06:00 |
Sean Sube
|
4ae4ce176c
|
fix(api): attempt to recycle leaking workers when a job finishes
|
2023-03-05 20:13:28 -06:00 |
Sean Sube
|
3a4928e59b
|
fix(api): prevent workers from blocking on their progress queues
|
2023-03-05 20:07:06 -06:00 |
Sean Sube
|
edc55ae8b4
|
fix(api): finished job notification should not block worker
|
2023-03-05 19:53:44 -06:00 |
Sean Sube
|
cfc20d3133
|
fix(api): improve cache logging
|
2023-03-05 19:30:52 -06:00 |
Sean Sube
|
39b9741b24
|
fix(api): show VRAM percent in logs
|
2023-03-05 19:23:23 -06:00 |
Sean Sube
|
7a3a81a4ef
|
fix(api): track and repeatedly attempt to recycle leaking workers (#219)
|
2023-03-05 18:58:13 -06:00 |
Sean Sube
|
468870e2e7
|
apply lint
|
2023-03-05 08:14:47 -06:00 |
Sean Sube
|
00fa9c5807
|
feat(api): collect labels from textual inversions (#144)
|
2023-03-05 08:00:25 -06:00 |
Sean Sube
|
6d2dd0a043
|
apply lint
|
2023-03-05 07:20:43 -06:00 |
Sean Sube
|
539d0febce
|
load extras during server init
|
2023-03-05 07:20:37 -06:00 |
Sean Sube
|
537c67619e
|
fix strings endpoint
|
2023-03-05 07:20:34 -06:00 |
Sean Sube
|
5d459ab17c
|
feat: allow users to add their own labels for models (#144)
|
2023-03-05 07:20:31 -06:00 |
Sean Sube
|
628812fb0b
|
fix(api): correct conversion imports
|
2023-03-04 23:03:15 -06:00 |
Sean Sube
|
43d4d4b86d
|
chore(api): dedupe model listing logic
|
2023-03-04 22:26:27 -06:00 |
Sean Sube
|
10fbafaff0
|
fix(api): correct imports
|
2023-03-04 22:25:49 -06:00 |
Sean Sube
|
7b88371832
|
lint(api): use load pattern in server submodule
|
2023-03-04 22:13:09 -06:00 |
Sean Sube
|
bb1d945761
|
lint(api): submodules for major libs
|
2023-03-04 22:12:35 -06:00 |
Sean Sube
|
39d36618e6
|
feat(api): add support for custom tokens for textual inversions (#179)
|
2023-03-02 23:32:20 -06:00 |
Sean Sube
|
22e9d54408
|
apply lint
|
2023-03-02 22:32:37 -06:00 |
Sean Sube
|
be1c8cd79f
|
feat(api): support ckpt inversions by splitting multivector embeddings into multiple tokens
|
2023-03-02 22:11:27 -06:00 |
Sean Sube
|
9a0d2051fb
|
fix(api): better logging when converting textual inversions
|
2023-03-02 07:57:59 -06:00 |
Sean Sube
|
46aac263d5
|
feat(api): initial support for textual inversion embeddings from civitai and others (#179)
|
2023-03-01 19:09:51 -06:00 |
Sean Sube
|
1f3a5f6f3c
|
fix(api): track completed jobs for each device worker (#170)
|
2023-03-01 19:09:18 -06:00 |
Sean Sube
|
4b77a00ca7
|
fix(api): load tokenizer with textual inversions
|
2023-03-01 19:08:31 -06:00 |
Sean Sube
|
21fc7c5968
|
fix(api): mark all convert methods as no_grad
|
2023-03-01 08:26:40 -06:00 |
Sean Sube
|
3f9f94fcb5
|
apply lint, remove unused
|
2023-02-28 23:05:17 -06:00 |
Sean Sube
|
2f4ab20f61
|
use filename for tensors
|
2023-02-28 22:49:53 -06:00 |
Sean Sube
|
74aae1b027
|
fix(api): write external weights into same directory as optimized model
|
2023-02-28 22:47:02 -06:00 |
Sean Sube
|
dbf9eaf1a4
|
fix(api): run shape inference before converting models to fp16
per discussion in https://github.com/microsoft/onnxruntime/issues/14827
|
2023-02-28 22:36:45 -06:00 |
Sean Sube
|
86984be5d4
|
better error logging during conversion
|
2023-02-28 22:36:36 -06:00 |
Sean Sube
|
9ef89db8b0
|
extract tensors after conversion
|
2023-02-28 22:36:33 -06:00 |
Sean Sube
|
7e65e21410
|
reload model from proto file before converting
|
2023-02-28 22:36:26 -06:00 |
Sean Sube
|
2210ee849b
|
only convert inner nodes with ORT conversion helpers
|
2023-02-28 22:26:04 -06:00 |
Sean Sube
|
b532570f54
|
fix(api): reuse ORT session params for each device
|
2023-02-28 22:24:38 -06:00 |
Sean Sube
|
565873b3ae
|
test half precision on all diffusion models
|
2023-02-28 22:23:55 -06:00 |
Sean Sube
|
1f9efb433a
|
apply lint
|
2023-02-28 22:04:55 -06:00 |
Sean Sube
|
12fb7f52bb
|
fix(api): sanitize filenames in user input
|
2023-02-28 21:56:12 -06:00 |
Sean Sube
|
c99aa67220
|
name threads, max queues, type/lint fixes
|
2023-02-28 21:44:52 -06:00 |
Sean Sube
|
c95ac1fbdd
|
avoid terminating workers because it breaks their queues
|
2023-02-28 08:53:17 -06:00 |
Sean Sube
|
0011f079d4
|
daemonize queue collectors
|
2023-02-28 06:55:15 -06:00 |
Sean Sube
|
cad0d37604
|
some pending queue logging
|
2023-02-27 23:43:38 -06:00 |
Sean Sube
|
4ae3d9caa2
|
remove task done
|
2023-02-27 23:18:37 -06:00 |
Sean Sube
|
7e0ccdb1af
|
remove pending queues after joining
|
2023-02-27 23:14:20 -06:00 |