[1.1.439] Instant ID #2589

huchenlei · 2024-01-27T22:31:34Z

huchenlei
Jan 27, 2024
Collaborator

Instant ID project

Instant ID uses a combination of ControlNet and IP-Adapter to control the facial features in the diffusion process. One unique design for Instant ID is that it passes facial embedding from IP-Adapter projection as crossattn input to the ControlNet unet. Normally the crossattn input to the ControlNet unet is prompt's text embedding.

Download models

You need to download following models and put them under {A1111_root}/models/ControlNet directory. It is also required to rename models to ip-adapter_instant_id_sdxl and control_instant_id_sdxl so that they can be correctly recognized by the extension.

How to use

InstantID takes 2 models on the UI. You should always set the ipadapter model as first model, as the ControlNet model takes the output from the ipadapter model. (ipadapter model should be hooked first)

Unit 0 Setting

You must set ip-adapter unit right before the ControlNet unit. The projected face embedding output of IP-Adapter unit will be used as part of input to the next ControlNet unit.

Unit 1 Setting

The ControlNet unit accepts a keypoint map of 5 facial keypoints. You are not restricted to use the facial keypoints of the same person you used in Unit 0. Here I use a different person's facial keypoints.

CFG

It is recommended to set CFG 4~5 to get best result. Depending on sampling method and base model this number may vary, but generally you need to use CFG scale a little bit less than normal CFG.

Output

Follow-up work

Make sd-webui-openpose-editor able to edit the facial keypoints in preprocessor result preview.
Currently even if you are using the same face for both model, the insightface preprocessor will run twice. We need to find a way to cache the result and only run the model once.
Support multiple face inputs.
Support high res fix.

Note

As the insightface's github release currently do not have antelopev2 model, we are downloading from a huggingface mirror https://huggingface.co/DIAMONIK7777/antelopev2. If you are in mainland China and don't have good internet connection to huggingface, you can manually download the model from somewhere else and place them under extensions/sd-webui-controlnet/annotators/downloads/insightface/models/antelopev2.

Known issues

If setting width x height to 1024 x 1024, there is higher chance to generate a watermark, logo. This cannot be corrected by adding logo watermark keywords in negative prompt. You can avoid this by setting other width x height values.

ivanoff13 · 2024-01-28T11:24:07Z

ivanoff13
Jan 28, 2024

only XL?

8 replies

Jiudy Mar 4, 2024

Please tell me where to place the antelopev2 model in the Forge file path.

rzw520 Mar 30, 2024

E:\webui_forge\webui\models\insightface\models\antelopev2

Jiudy May 21, 2024

Your help is godsent, you're the kindest soul around, thank you so much, blessings will come your way for sure.

ramnagapuri Jul 3, 2024

where i put this path

loven306 Nov 27, 2024

The color is too bright，Why?

2blackbar · 2024-01-28T12:35:10Z

2blackbar
Jan 28, 2024

It does work but yeah, it loads the models over and over and over which takes like over minute of waiting time on 3090, so each image takes almost 2 minutes to generate cause of loading times, even if you wont change any reference images.
Imo results are sometimes spectacular considering 0 training.
Id do a check if any ref images were changed and if they werent- dont reload anything and just generate again, imo this behaviou should be default for all cnet models so theres no constant reloading when using future models as well.
Id also do another check - if theres no separate instance for the pose (lipcorners dots and iris dots ), just use preprocessor image for the pose.
Also your reference image should not be so close that jaw touches the bottom of the image - this is too close and wont be recognized, you should have some neck showing.
-for some reason in lot of wider pics i have hand obscuring the low part of the face , its pretty strange, happens when shoulders are already visible in framing .

4 replies

stasha17 Jan 28, 2024

The results I'm getting are very similar to the above. For some reason, they look quite "washed out" of color. Not sure why.

2blackbar Jan 28, 2024

you mean the contrast? i do painting by vallejo and frazetta prompt , maybe try to up the cfg scale to get more contrast, i personally dont like it and i use levels later to bring contrast back.All those are CFG 3 and 20steps

hansolocambo Feb 6, 2024

I also have an 3090RTX (i9-12900K, 32GB RAM) and I generate (with Instant_ID) a 1024x1024 image in 21sec, and with highres fix x2, it takes 1m15s. It's long, but far from 2 minutes.

huzheng0419 Apr 14, 2024

Why do all the images I generate that have an instant ID appear a bit yellowish? It's normal for images that don't have an instant ID. This is a comparison image, and all the parameters have not changed. Can someone answer this question

hben35096 · 2024-01-28T14:04:16Z

hben35096
Jan 28, 2024

Works great, thank you for your efforts.

Things to watch out for at the moment:
1.CFG needs to be lower than usual, depending on the SD model
2. Avoid using the size of 1024*1024
3. The step count needs to be lower than usual

8 replies

hben35096 Jan 28, 2024

Haha, no, InstantID only detects face keypoints, and the body pose is random.
The description of the resolution is just whether there will be any abnormalities outside of the test 1024*1024.

2blackbar Jan 28, 2024

well, my results has something obscuring the face (fingers or warrdobe element or hair ) way way too often compared to not using instantID, especially near lips/jaw which is superstrange and i dont think its a coincidence

f0ster Jan 29, 2024

Thanks for sharing @hben35096 . may have meant up to 1024^2, these settings look good for me at (1024,1024)

🍻

minhquang1189 Feb 5, 2024

Could you please guide me on how to fix this error? Thank you.

jamesstothard Feb 7, 2024

I'm gettting very strange behaviour if I set the size below 10241024 then controlnet doesn't even work!?

This one is at 10241024

FurkanGozukara · 2024-01-28T14:25:00Z

FurkanGozukara
Jan 28, 2024

awesome this time i didnt make gradio and now implemented :D

if face input not changed you really should cache it

i did this for IP Adapter Face ID and it did speed up huge. caching the calculated face input vectors

https://youtu.be/rjXsJ24kQQg

6 replies

huchenlei Jan 28, 2024
Collaborator Author

We do have preprocessor result cache here:

sd-webui-controlnet/scripts/global_state.py

Lines 22 to 49 in a67d040

    
           def cache_preprocessors(preprocessor_modules: Dict[str, Callable]) -> Dict[str, Callable]: 
        
               """ We want to share the preprocessor results in a single big cache, instead of a small 
        
                cache for each preprocessor function. """ 
        
               CACHE_SIZE = getattr(shared.cmd_opts, "controlnet_preprocessor_cache_size", 0) 
        
               # Set CACHE_SIZE = 0 will completely remove the caching layer. This can be 
        
               # helpful when debugging preprocessor code. 
        
               if CACHE_SIZE == 0: 
        
                   return preprocessor_modules 
        
               logger.debug(f'Create LRU cache (max_size={CACHE_SIZE}) for preprocessor results.') 
        
               @ndarray_lru_cache(max_size=CACHE_SIZE) 
        
               def unified_preprocessor(preprocessor_name: str, *args, **kwargs): 
        
                   logger.debug(f'Calling preprocessor {preprocessor_name} outside of cache.') 
        
                   return preprocessor_modules[preprocessor_name](*args, **kwargs) 
        
               # TODO: Introduce a seed parameter for shuffle preprocessor? 
        
               uncacheable_preprocessors = ['shuffle'] 
        
               return { 
        
                   k: ( 
        
                       v if k in uncacheable_preprocessors 
        
                       else functools.partial(unified_preprocessor, k) 
        
                   ) 
        
                   for k, v 
        
                   in preprocessor_modules.items() 
        
               }

. If you don't change preprocessor input, the cached result will be used. If you add --controlnet-loglevel DEBUG, you will see whether the cached preprocessor result is used.

2blackbar Jan 28, 2024

so if models are cached, why it takes so long and models are reloaded each time it generates ? Nothing should be reloaded if nothing was changed.

huchenlei Jan 28, 2024
Collaborator Author

If you are talking about load of ControlNet/IP-Adapter models, you should set model cache size to a value > 1. (By default that config is 1. As we are using 2 models together, the cache size of 1 means we miss the cache every time.)

2blackbar Jan 28, 2024

DAMN! This is it, can you make it default controlnet setting ? Now i can generate non stop with like 8 seconds per image, its great. i dont think people know this and it changes A LOT from 2 minutes to 8 seconds.
And whats best - you can change the photo of the person and its still 8 seconds per image on 3090

FurkanGozukara Jan 29, 2024

this info super important thanks

zachysaur · 2024-01-28T16:24:24Z

zachysaur
Jan 28, 2024

100%|██████████████████████████████████████████████████████████████████████████████████| 20/20 [00:43<00:00, 2.19s/it]
Total progress: 100%|██████████████████████████████████████████████████████████████████| 20/20 [00:42<00:00, 2.14s/it]
2024-01-28 21:23:52,651 - ControlNet - INFO - unit_separate = False, style_align = False 20/20 [00:42<00:00, 1.97s/it]
*** Error running process: D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py
Traceback (most recent call last):
File "D:\Webui1111\Webui\stable-diffusion-portable-main\modules\scripts.py", line 718, in process
script.process(p, *script_args)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1138, in process
self.controlnet_hack(p)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1123, in controlnet_hack
self.controlnet_main_entry(p)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 868, in controlnet_main_entry
Script.check_sd_version_compatible(unit)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 793, in check_sd_version_compatible
raise Exception(f"ControlNet model {unit.model}({cnet_sd_version}) is not compatible with sd model({sd_version})")
Exception: ControlNet model ip-adapter_instant_id_sdxl eb2d3ec0 is not compatible with sd model(StableDiffusionVersion.SD1x)

4 replies

huchenlei Jan 28, 2024
Collaborator Author

Please use SDXL base model. Currently there is no instant id for sd15.

matrix4767 Jan 28, 2024

Are there plans for sd15 support?

huchenlei Jan 28, 2024
Collaborator Author

You can file a request in instant id repo here: https://github.com/InstantID/InstantID. I don't have the resource to train model.

trinitronx Jul 11, 2025

There's this InstantID-SD1.5 set of models. I dropped both models into the models/ControlNet/ directory, renaming them similarly to the SDXL ones:

ip-state.ckpt -> ip-adapter_instant_id_sd15.ckpt
controlnet.ckpt -> control_instant_id_sd15.ckpt

Seems that the pre-processor preview works when clicking the 💥^{(Run preprocessor)} button in the ControlNet v1.1.455 section's ControlNet Unit 0 [Instant-ID] image tab. All this tested in the main txt2img with top-level SD checkpoint model: v1-5-pruned-emaonly.safetensors [6ce0161689].

Tested that preview appears to be working with the following pairs:

Preprocessor	Model
`instant_id_face_embedding`	File: `ip-adapter_instant_id_sd15.ckpt`, UI Name + [`sha256`]: `ip-adapter_instant_id_sd15 [718243bb]`
`instant_id_face_keypoints`	File: `control_instant_id_sd15.ckpt`, UI Name + [`sha256`]: `control_instant_id_sd15 [61a0b394]`

instant_id_face_keypoints produces the typical 5-point output in the preview window¹. Tested on a ROCm system (so no CUDA libraries), thus logs show onnx fails to load some CUDA-related libcublasLt.so.12 library, but retries and then continues onward without it, using the CPU instead. Logs from running the keypoints preview:

Expand for instant_id_face_keypoints preview logs

2025-07-11 06:20:38,837 - ControlNet - INFO - Preview Resolution = 512
2025-07-11 06:20:38,841 - ControlNet - DEBUG - Calling preprocessor instant_id_face_keypoints outside of cache.
2025-07-11 06:20:39.436290238 [E:onnxruntime:Default, provider_bridge_ort.cc:2195 TryGetProviderInfo_CUDA] /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1778 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.12: cannot open shared object file: No such file or directory
Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/1k3d68.onnx landmark_3d_68 ['None', 3, 192, 192] 0.0 1.0
2025-07-11 06:21:15.123639455 [E:onnxruntime:Default, provider_bridge_ort.cc:2195 TryGetProviderInfo_CUDA] /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1778 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.12: cannot open shared object file: No such file or directory
Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/2d106det.onnx landmark_2d_106 ['None', 3, 192, 192] 0.0 1.0
2025-07-11 06:21:16.373186992 [E:onnxruntime:Default, provider_bridge_ort.cc:2195 TryGetProviderInfo_CUDA] /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1778 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.12: cannot open shared object file: No such file or directory
Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/genderage.onnx genderage ['None', 3, 96, 96] 0.0 1.0
2025-07-11 06:21:17.396098852 [E:onnxruntime:Default, provider_bridge_ort.cc:2195 TryGetProviderInfo_CUDA] /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1778 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.12: cannot open shared object file: No such file or directory
Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/glintr100.onnx recognition ['None', 3, 112, 112] 127.5 127.5
2025-07-11 06:21:17.710983673 [E:onnxruntime:Default, provider_bridge_ort.cc:2195 TryGetProviderInfo_CUDA] /onnxruntime_src/onnxruntime/core/session/provider_bridge_ort.cc:1778 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_cuda.so with error: libcublasLt.so.12: cannot open shared object file: No such file or directory
Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/scrfd_10g_bnkps.onnx detection [1, 3, '?', '?'] 127.5 128.0
set det-size: (640, 640)
/opt/conda/envs/py_3.12/lib/python3.12/site-packages/insightface/utils/transform.py:68: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions.
To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`.
  P = np.linalg.lstsq(X_homo, Y)[0].T # Affine matrix. 3 x 4

EDIT: I was able to get ROCm working with onnx after installing onnxruntime-rocm==1.21.0 (from: here ), and applying the following patch to sd-webui-controlnet²:

diff --git a/scripts/preprocessor/legacy/processor.py b/scripts/preprocessor/legacy/processor.py
index 1a2ff0a..16a15e4 100644
--- a/scripts/preprocessor/legacy/processor.py
+++ b/scripts/preprocessor/legacy/processor.py
@@ -687,7 +687,7 @@ class InsightFaceModel:
             from annotator.annotator_path import models_path
             self.model = FaceAnalysis(
                 name=self.face_analysis_model_name,
-                providers=['CUDAExecutionProvider', 'CPUExecutionProvider'],
+                providers=['CUDAExecutionProvider', 'ROCMExecutionProvider', 'CPUExecutionProvider'],
                 root=os.path.join(models_path, "insightface"),
             )
             self.model.prepare(ctx_id=0, det_size=(640, 640))

Expand for ROCm instant_id_face_keypoints preview logs

2025-07-11 07:16:53,812 - ControlNet - INFO - Preview Resolution = 512
2025-07-11 07:16:53,815 - ControlNet - DEBUG - Calling preprocessor instant_id_face_keypoints outside of cache.
Applied providers: ['ROCMExecutionProvider', 'CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}, 'ROCMExecutionProvider': {'tunable_op_max_tuning_duration_ms': '0', 'tunable_op_tuning_enable': '0', 'device_id': '0', 'miopen_conv_use_max_workspace': '1', 'has_user_compute_stream': '0', 'user_compute_stream': '0', 'gpu_external_empty_cache': '0', 'gpu_mem_limit': '18446744073709551615', 'gpu_external_alloc': '0', 'gpu_external_free': '0', 'enable_hip_graph': '0', 'arena_extend_strategy': 'kNextPowerOfTwo', 'do_copy_in_default_stream': '1', 'miopen_conv_exhaustive_search': '0', 'tunable_op_enable': '0'}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/1k3d68.onnx landmark_3d_68 ['None', 3, 192, 192] 0.0 1.0
Applied providers: ['ROCMExecutionProvider', 'CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}, 'ROCMExecutionProvider': {'tunable_op_max_tuning_duration_ms': '0', 'tunable_op_tuning_enable': '0', 'device_id': '0', 'miopen_conv_use_max_workspace': '1', 'has_user_compute_stream': '0', 'user_compute_stream': '0', 'gpu_external_empty_cache': '0', 'gpu_mem_limit': '18446744073709551615', 'gpu_external_alloc': '0', 'gpu_external_free': '0', 'enable_hip_graph': '0', 'arena_extend_strategy': 'kNextPowerOfTwo', 'do_copy_in_default_stream': '1', 'miopen_conv_exhaustive_search': '0', 'tunable_op_enable': '0'}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/2d106det.onnx landmark_2d_106 ['None', 3, 192, 192] 0.0 1.0
Applied providers: ['ROCMExecutionProvider', 'CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}, 'ROCMExecutionProvider': {'tunable_op_max_tuning_duration_ms': '0', 'tunable_op_tuning_enable': '0', 'device_id': '0', 'miopen_conv_use_max_workspace': '1', 'has_user_compute_stream': '0', 'user_compute_stream': '0', 'gpu_external_empty_cache': '0', 'gpu_mem_limit': '18446744073709551615', 'gpu_external_alloc': '0', 'gpu_external_free': '0', 'enable_hip_graph': '0', 'arena_extend_strategy': 'kNextPowerOfTwo', 'do_copy_in_default_stream': '1', 'miopen_conv_exhaustive_search': '0', 'tunable_op_enable': '0'}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/genderage.onnx genderage ['None', 3, 96, 96] 0.0 1.0
Applied providers: ['ROCMExecutionProvider', 'CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}, 'ROCMExecutionProvider': {'tunable_op_max_tuning_duration_ms': '0', 'tunable_op_tuning_enable': '0', 'device_id': '0', 'miopen_conv_use_max_workspace': '1', 'has_user_compute_stream': '0', 'user_compute_stream': '0', 'gpu_external_empty_cache': '0', 'gpu_mem_limit': '18446744073709551615', 'gpu_external_alloc': '0', 'gpu_external_free': '0', 'enable_hip_graph': '0', 'arena_extend_strategy': 'kNextPowerOfTwo', 'do_copy_in_default_stream': '1', 'miopen_conv_exhaustive_search': '0', 'tunable_op_enable': '0'}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/glintr100.onnx recognition ['None', 3, 112, 112] 127.5 127.5
Applied providers: ['ROCMExecutionProvider', 'CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}, 'ROCMExecutionProvider': {'tunable_op_max_tuning_duration_ms': '0', 'tunable_op_tuning_enable': '0', 'device_id': '0', 'miopen_conv_use_max_workspace': '1', 'has_user_compute_stream': '0', 'user_compute_stream': '0', 'gpu_external_empty_cache': '0', 'gpu_mem_limit': '18446744073709551615', 'gpu_external_alloc': '0', 'gpu_external_free': '0', 'enable_hip_graph': '0', 'arena_extend_strategy': 'kNextPowerOfTwo', 'do_copy_in_default_stream': '1', 'miopen_conv_exhaustive_search': '0', 'tunable_op_enable': '0'}}
find model: /data/extensions/sd-webui-controlnet/annotator/downloads/insightface/models/antelopev2/scrfd_10g_bnkps.onnx detection [1, 3, '?', '?'] 127.5 128.0
set det-size: (640, 640)
/opt/conda/envs/py_3.12/lib/python3.12/site-packages/insightface/utils/transform.py:68: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions.
To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`.
  P = np.linalg.lstsq(X_homo, Y)[0].T # Affine matrix. 3 x 4

Running the preprocessor preview for instant_id_face_embedding with ip-adapter_instant_id_sd15.ckpt model produces a preview output that looks pretty much identical to the ControlNet input image. It wasn't clear if this was working as intended.

Logs for instant_id_face_embedding preprocessor preview are shorter (no onnx library failures):

[...SNIP...]
# Lots of PIL / pillow module output (looked ok)
[...SNIP...]
2025-07-11 06:29:45,602 - ControlNet - INFO - Preview Resolution = 512
2025-07-11 06:29:45,606 - ControlNet - DEBUG - Calling preprocessor instant_id_face_embedding outside of cache.
/opt/conda/envs/py_3.12/lib/python3.12/site-packages/insightface/utils/transform.py:68: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions.
To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`.
  P = np.linalg.lstsq(X_homo, Y)[0].T # Affine matrix. 3 x 4

As I had turned on DEBUG output, I added a DEBUG print line to scripts/utils.py to print the model path and type, to double-check it is indeed trying to be loaded:

diff --git a/scripts/utils.py b/scripts/utils.py
index e660279..e75542c 100644
--- a/scripts/utils.py
+++ b/scripts/utils.py
@@ -17,6 +17,10 @@ from scripts.logging import logger
 
 def load_state_dict(ckpt_path, location="cpu"):
     _, extension = os.path.splitext(ckpt_path)
+    logger.debug(f"load_state_dict({ckpt_path}, {location}): Trying to load state_dict from [{ckpt_path}] into {location}")
     if extension.lower() == ".safetensors":
         state_dict = safetensors.torch.load_file(ckpt_path, device=location)
     else:

Expand for full logs from txt2img + ControlNet SD 1.5 model run attempt

2025-07-11 06:01:34 INFO [modules.shared_state] Starting job task(qu4e1qxt5cd2k1h)
2025-07-11 06:01:34,757 - ControlNet - DEBUG - parse_remote_call ran in: 0.001 sec
2025-07-11 06:01:34,757 - ControlNet - DEBUG - get_enabled_units ran in: 0.002 sec
2025-07-11 06:01:34,757 - ControlNet - INFO - unit_separate = False, style_align = False
2025-07-11 06:01:34,978 - ControlNet - DEBUG - clear_control_model_cache ran in: 0.221 sec
2025-07-11 06:01:34,979 - ControlNet - DEBUG - Unload unused preprocessors. Active: ['instant_id_face_embedding', 'instant_id_face_keypoints']
2025-07-11 06:01:34,979 - ControlNet - INFO - Loading model: ip-adapter_instant_id_sd15 [718243bb]
2025-07-11 06:01:34,979 - ControlNet - DEBUG - load_state_dict(/data/models/ControlNet/ip-adapter_instant_id_sd15.ckpt, cpu): Trying to load state_dict from [/data/models/ControlNet/ip-adapter_instant_id_sd15.ckpt] into cpu
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'IHDR' 16 13
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'IDAT' 41 1858
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'IHDR' 16 13
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'iCCP' 41 354
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] iCCP profile name b'icc'
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] Compression method 0
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'cHRM' 407 32
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'bKGD' 451 6
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] b'bKGD' 451 6 (unknown)
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'pHYs' 469 9
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'tIME' 490 7
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] b'tIME' 490 7 (unknown)
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'zTXt' 509 415
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'zTXt' 936 1192
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'orNT' 2140 1
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] b'orNT' 2140 1 (unknown)
2025-07-11 06:01:34 DEBUG [PIL.PngImagePlugin] STREAM b'IDAT' 2153 32768
2025-07-11 06:01:35,011 - ControlNet - INFO - Loaded state_dict from [/data/models/ControlNet/ip-adapter_instant_id_sd15.ckpt]
*** Error running process: /data/extensions/sd-webui-controlnet/scripts/controlnet.py
    Traceback (most recent call last):
      File "/app/modules/scripts.py", line 832, in process
        script.process(p, *script_args)
      File "/data/extensions/sd-webui-controlnet/scripts/utils.py", line 101, in wrapper
        result = func(*args, **kwargs)
                 ^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1252, in process
        self.controlnet_hack(p)
      File "/data/extensions/sd-webui-controlnet/scripts/utils.py", line 101, in wrapper
        result = func(*args, **kwargs)
                 ^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1237, in controlnet_hack
        self.controlnet_main_entry(p)
      File "/data/extensions/sd-webui-controlnet/scripts/utils.py", line 101, in wrapper
        result = func(*args, **kwargs)
                 ^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet.py", line 943, in controlnet_main_entry
        model_net, control_model_type = Script.load_control_model(p, unet, unit.model)
                                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/utils.py", line 101, in wrapper
        result = func(*args, **kwargs)
                 ^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet.py", line 460, in load_control_model
        control_model = Script.build_control_model(p, unet, model)
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/utils.py", line 101, in wrapper
        result = func(*args, **kwargs)
                 ^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet.py", line 489, in build_control_model
        control_model = build_model_by_guess(state_dict, unet, model_path)
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/data/extensions/sd-webui-controlnet/scripts/controlnet_model_guess.py", line 292, in build_model_by_guess
        raise Exception('[ControlNet Error] Cannot recognize the ControlModel!')
    Exception: [ControlNet Error] Cannot recognize the ControlModel!
---
[...SNIP...]
# Lots more PIL / pillow module debug output here
[...SNIP...]
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:03<00:00,  6.18it/s]
2025-07-11 06:01:38,823 - ControlNet - DEBUG - postprocess ran in: 0.002 sec█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:03<00:00,  5.67it/s]
Total progress: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:03<00:00,  5.42it/s]
2025-07-11 06:01:38 INFO [modules.shared_state] Ending job task(qu4e1qxt5cd2k1h) (4.07 seconds)██████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:03<00:00,  5.67it/s]

The resulting image was missing the facial features and keypoints, as if ControlNet was not enabled at all.

During the full run, it must have failed to load the model at some point. It's likely that build_model_by_guess() doesn't know what to do with these .ckpt model files. Maybe it would work if they could be converted into another format, or some code added to the function to handle them?

Although, evidently the InstantID-SD1.5 keypoints model supports more detailed face keypoints. ↩
ROCm support patch has been submitted in PR Add ROCm support (AMD GPUs) #3096 ↩

zachysaur · 2024-01-28T16:48:43Z

zachysaur
Jan 28, 2024

*** Error running process: D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py
Traceback (most recent call last):
File "D:\Webui1111\Webui\stable-diffusion-portable-main\modules\scripts.py", line 718, in process
script.process(p, *script_args)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1138, in process
self.controlnet_hack(p)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1123, in controlnet_hack
self.controlnet_main_entry(p)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 868, in controlnet_main_entry
Script.check_sd_version_compatible(unit)
File "D:\Webui1111\Webui\stable-diffusion-portable-main\extensions\sd-webui-controlnet\scripts\controlnet.py", line 793, in check_sd_version_compatible
raise Exception(f"ControlNet model {unit.model}({cnet_sd_version}) is not compatible with sd model({sd_version})")
Exception: ControlNet model control_instant_id_sdxl c5c25a50 is not compatible with sd model(StableDiffusionVersion.SD1x)

0 replies

ghzgod · 2024-01-28T16:49:21Z

ghzgod
Jan 28, 2024

I get an assertion error using all the same settings as above with an SDXL model of course.

AttributeError: 'UNetModel' object has no attribute 'cond_mark'

01/28/2024
05:48:12 PM
*** Error running process: /config/02-sd-webui/webui/extensions/sd-webui-controlnet/scripts/controlnet.py
01/28/2024
05:48:12 PM
    Traceback (most recent call last):
01/28/2024
05:48:12 PM
      File "/config/02-sd-webui/webui/modules/scripts.py", line 718, in process
01/28/2024
05:48:12 PM
        script.process(p, *script_args)
01/28/2024
05:48:12 PM
      File "/config/02-sd-webui/webui/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1138, in process
01/28/2024
05:48:12 PM
        self.controlnet_hack(p)
01/28/2024
05:48:12 PM
      File "/config/02-sd-webui/webui/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1123, in controlnet_hack
01/28/2024
05:48:12 PM
        self.controlnet_main_entry(p)
01/28/2024
05:48:12 PM
      File "/config/02-sd-webui/webui/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1103, in controlnet_main_entry
01/28/2024
05:48:12 PM
        assert isinstance(param.hint_cond, torch.Tensor)
01/28/2024
05:48:12 PM
    AssertionError
01/28/2024
05:48:12 PM
01/28/2024
05:48:12 PM
---
01/28/2024
05:48:18 PM
  0%|          | 0/20 [00:00<?, ?it/s]
  0%|          | 0/20 [00:01<?, ?it/s]
01/28/2024
05:48:18 PM
*** Error completing request

4 replies

k8kiss Feb 3, 2024

Where you able to fix this? I've got the same error

wicherek1 Feb 6, 2024

The same error. Did you find the solution for that?

vashtsp Mar 1, 2024

Try another XL model

sitzbrau Sep 13, 2024

i've got this error with Pony Diffusion V6 XL checkpoint

navytut2 · 2024-01-28T20:15:39Z

navytut2
Jan 28, 2024

Working fine. Thank you.
One doubt: If I don't want to provide a 2nd image as a pose and instead want it to give a random pose image with the first face in Unit 0, how to go about it?
InstandID's replicate demo has this option with single face input.

4 replies

huchenlei Jan 28, 2024
Collaborator Author

Can you link to the implementation code to InstandID's replicate demo?

Lalimec Jan 28, 2024

https://replicate.com/zsxkib/instant-id though im not sure i understand what he means. Uploading the same image for the control would do the trick already.

huchenlei Jan 28, 2024
Collaborator Author

I don't think the demo site supports generating random head position when you don't specify a pose.

a3nima Jan 29, 2024

Currently it seems the second image is defining not only the position but also the size of the head in the image. It would be good, if the very well working face reproduction could be used freely like in face-id or an edit function could be used :)

Angtrim · 2024-01-29T09:09:26Z

Angtrim
Jan 29, 2024

Does it work also in img2img? More or less like "roop" ?

0 replies

trygvebw · 2024-01-29T10:18:05Z

trygvebw
Jan 29, 2024

Thanks for your work! However, after playing around with this for a bit, something seems off compared to using InstantId through the new Diffusers pipeline. Let me demonstrate with an example (note, however, that the same happens with every image combination I've tried). I'm using a picture of (Daniel Radcliffe as) Harry Potter in ControlUnit 0 (ip-adapter) and a picture of Obama in ControlUnit 1. Here's the result with the "Control Weight" of Unit 0 set equal to 1.0:

Obviously, the quality is rather poor. The CFG scale is set to 4.0 and I use the euler-a sampler. Now, if I turn the "Control Weight" of Unit 0 down to 0.0, I get this:

This looks just slightly less like Radcliffe's Potter than the first image, but the quality is obviously vastly better. If I try values of Unit 0's "Control Weight" between 0.0 and 1.0, the quality is always degraded (although not as much as at 1.0) – even at low values like 0.2, the output image is noticeably blurry. If I keep the "Control Weight" at 1.0 but decrease the CFG scale to 3.0 or 2.0, the quality improves somewhat but the blurriness is still there (and the contrast gets worse):

Across several different input image combinations I've found that I get the best compromise between quality and identity fidelity when ControlUnit 0's "Control Weight" is 0.0 or a very low value like 0.05 or 0.1, and the CFG scale is as high as I can push it without getting too much contrast (usually around 4.0). That's somewhat odd, and I don't experience the same when using InstantId directly – the best combination of weights is then something like 0.8 and 0.8.

Any thoughts?

9 replies

Lalimec Feb 2, 2024

20-30 seconds on a g5 ec2 instance and a rtx4090 rig. both with 24gb vram. Generation times are quite a bit high with instantid. Only problem was forgetting to put the models in the proper folder, other than that there are 2 different comfy node set. ZHO was a little more flexible but both are far more restricted than the a1111 versions since they are just diffusers wrappers.

dasoncheng Mar 7, 2024

Excuse me, what is the cause of this clarity problem in the webui and is there a solution

rubensicko Apr 9, 2024

Hi! Can you show me the prompt (or model) for this amazing retro pink style? Many thanks!!

a3nima Apr 10, 2024

Prompt:
vaporwave style portrait of a woman, scenery in the background . retro aesthetic, cyberpunk, vibrant, neon colors, vintage 80s and 90s style, highly detailed,
Negative prompt: monochrome, muted colors, realism, rustic, minimalist, dark,
Steps: 15, Sampler: DPM++ 2M Karras, CFG scale: 5, Seed: 2, Size: 960x1280, Model hash: dda8c0514c, Model: sdxlUnstableDiffusers_v11Rundiffusion, ControlNet 0: "Module: instant_id_face_embedding, Model: ip-adapter_instant_id_sdxl [eb2d3ec0], Weight: 0.6, Resize Mode: Crop and Resize, Low Vram: False, Processor Res: 512, Guidance Start: 0, Guidance End: 1, Pixel Perfect: True, Control Mode: ControlNet is more important, Hr Option: Both, Save Detected Map: True", ControlNet 1: "Module: instant_id_face_keypoints, Model: control_instant_id_sdxl [c5c25a50], Weight: 0.6, Resize Mode: Crop and Resize, Low Vram: False, Processor Res: 512, Guidance Start: 0, Guidance End: 1, Pixel Perfect: True, Control Mode: ControlNet is more important, Hr Option: Both, Save Detected Map: True", Version: v1.7.0

Image Width: 960 px
Image Height: 1280 px

Lalimec Apr 10, 2024

Man its been a while lol. But images should contain metada, try to drop them on comfy or a1111.

pxEkin · 2024-01-29T16:00:50Z

pxEkin
Jan 29, 2024

How to deal with multiple faces in one picture? i tested it on the webui, and instantid only handled one face

0 replies

pwcarney · 2024-01-29T19:56:38Z

pwcarney
Jan 29, 2024

Seems to download the onnx files and then complains about the onnxruntime. I tried deleting and forcing it to reinstall the onnx files and the onnxruntime from the venv, but no luck. Thoughts?

Downloading: "https://huggingface.co/DIAMONIK7777/antelopev2/resolve/main/1k3d68.onnx" to D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\1k3d68.onnx

100%|███████████████████████████████████████████████████████████████████████████████| 137M/137M [00:10<00:00, 14.0MB/s]
Downloading: "https://huggingface.co/DIAMONIK7777/antelopev2/resolve/main/2d106det.onnx" to D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\2d106det.onnx

100%|█████████████████████████████████████████████████████████████████████████████| 4.80M/4.80M [00:00<00:00, 12.8MB/s]
Downloading: "https://huggingface.co/DIAMONIK7777/antelopev2/resolve/main/genderage.onnx" to D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\genderage.onnx

100%|█████████████████████████████████████████████████████████████████████████████| 1.26M/1.26M [00:00<00:00, 8.26MB/s]
Downloading: "https://huggingface.co/DIAMONIK7777/antelopev2/resolve/main/glintr100.onnx" to D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\glintr100.onnx

100%|███████████████████████████████████████████████████████████████████████████████| 249M/249M [00:19<00:00, 13.6MB/s]
Downloading: "https://huggingface.co/DIAMONIK7777/antelopev2/resolve/main/scrfd_10g_bnkps.onnx" to D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\scrfd_10g_bnkps.onnx

100%|█████████████████████████████████████████████████████████████████████████████| 16.1M/16.1M [00:01<00:00, 13.1MB/s]
2024-01-29 14:53:13.3130372 [E:onnxruntime:Default, provider_bridge_ort.cc:1480 onnxruntime::TryGetProviderInfo_CUDA] D:\a_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1193 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "D:\stable-diffusion-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_providers_cuda.dll"

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\1k3d68.onnx landmark_3d_68 ['None', 3, 192, 192] 0.0 1.0
2024-01-29 14:53:15.2392103 [E:onnxruntime:Default, provider_bridge_ort.cc:1480 onnxruntime::TryGetProviderInfo_CUDA] D:\a_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1193 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "D:\stable-diffusion-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_providers_cuda.dll"

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\2d106det.onnx landmark_2d_106 ['None', 3, 192, 192] 0.0 1.0
2024-01-29 14:53:15.3354037 [E:onnxruntime:Default, provider_bridge_ort.cc:1480 onnxruntime::TryGetProviderInfo_CUDA] D:\a_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1193 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "D:\stable-diffusion-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_providers_cuda.dll"

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\genderage.onnx genderage ['None', 3, 96, 96] 0.0 1.0
2024-01-29 14:53:15.5136602 [E:onnxruntime:Default, provider_bridge_ort.cc:1480 onnxruntime::TryGetProviderInfo_CUDA] D:\a_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1193 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "D:\stable-diffusion-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_providers_cuda.dll"

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\glintr100.onnx recognition ['None', 3, 112, 112] 127.5 127.5
2024-01-29 14:53:18.4024247 [E:onnxruntime:Default, provider_bridge_ort.cc:1480 onnxruntime::TryGetProviderInfo_CUDA] D:\a_work\1\s\onnxruntime\core\session\provider_bridge_ort.cc:1193 onnxruntime::ProviderLibrary::Get [ONNXRuntimeError] : 1 : FAIL : LoadLibrary failed with error 126 "" when trying to load "D:\stable-diffusion-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_providers_cuda.dll"

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}}
find model: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\annotator\downloads\insightface\models\antelopev2\scrfd_10g_bnkps.onnx detection [1, 3, '?', '?'] 127.5 128.0
set det-size: (640, 640)
*** Error running process: D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py
Traceback (most recent call last):
File "D:\stable-diffusion-webui\modules\scripts.py", line 718, in process
script.process(p, script_args)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1143, in process
self.controlnet_hack(p)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 1128, in controlnet_hack
self.controlnet_main_entry(p)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 969, in controlnet_main_entry
controls, hr_controls = list(zip([preprocess_input_image(img) for img in input_images]))
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 969, in
controls, hr_controls = list(zip(*[preprocess_input_image(img) for img in input_images]))
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\controlnet.py", line 936, in preprocess_input_image
detected_map, is_image = self.preprocessor[unit.module](
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\utils.py", line 80, in decorated_func
return cached_func(*args, **kwargs)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\utils.py", line 64, in cached_func
return func(*args, **kwargs)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\global_state.py", line 37, in unified_preprocessor
return preprocessor_modules[preprocessor_name](*args, **kwargs)
File "D:\stable-diffusion-webui\extensions\sd-webui-controlnet\scripts\processor.py", line 806, in run_model_instant_id
raise Exception(f"Insightface: No face found in image.")
Exception: Insightface: No face found in image.

2 replies

FurkanGozukara Jan 29, 2024

it couldnt detect face in your input image

2blackbar Jan 30, 2024

your face must have neck visible, otherwise it wont be detected

scarbain · 2024-01-29T22:38:12Z

scarbain
Jan 29, 2024

Great feature, thanks a lot !!
When trying out multiple images as input of unit 0 and get a "No face found in image", it would be absolutely awesome to get the filename of the corresponding image ! Same if we use different image between unit 0 and unit 1, it'd be much easier to know which image should be removed.
I have not made lots of tests but I found that when no face is detected because it's too close, adding some black padding around the image helps the detector.

0 replies

saibai · 2024-01-30T08:26:28Z

saibai
Jan 30, 2024

Hello, I have this error in Automatic1111 for Mac M2

*** Error running process: /Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py
Traceback (most recent call last):
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/modules/scripts.py", line 718, in process
script.process(p, script_args)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1143, in process
self.controlnet_hack(p)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py", line 1128, in controlnet_hack
self.controlnet_main_entry(p)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py", line 969, in controlnet_main_entry
controls, hr_controls = list(zip([preprocess_input_image(img) for img in input_images]))
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py", line 969, in
controls, hr_controls = list(zip(*[preprocess_input_image(img) for img in input_images]))
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/controlnet.py", line 936, in preprocess_input_image
detected_map, is_image = self.preprocessor[unit.module](
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/utils.py", line 80, in decorated_func
return cached_func(*args, **kwargs)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/utils.py", line 64, in cached_func
return func(*args, **kwargs)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/global_state.py", line 37, in unified_preprocessor
return preprocessor_modules[preprocessor_name](*args, **kwargs)
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/processor.py", line 801, in run_model_instant_id
self.load_model()
File "/Users/cinesaibai/pinokio/api/sd-webui.pinokio.git/automatic1111/extensions/sd-webui-controlnet/scripts/processor.py", line 739, in load_model
from insightface.app import FaceAnalysis
ModuleNotFoundError: No module named 'insightface'

4 replies

b4sh Jan 30, 2024

ModuleNotFoundError: No module named 'insightface'

saibai Jan 30, 2024

I understood, but I have installed everything and I find automatic1111/extensions/sd-webui-controlnet/annotator/downloads/insightface/models with all the files

b4sh Jan 30, 2024

Please read sd-webui-reactor README about install insightface on Windows - https://github.com/Gourieff/sd-webui-reactor

saibai Feb 5, 2024

#2589 (reply in thread)

2blackbar · 2024-01-30T10:53:37Z

2blackbar
Jan 30, 2024

It works absolute best if your input face have half shadow on it so the volume of the face is read better by ipadapter , with flat light stallone was unrecognizable

2 replies

jamsdrak Feb 2, 2024

Amazing results, just how did you got that output?? checkpoint, did you add lora or just prompt, or was because of the control_instantID

2blackbar Feb 3, 2024

I think this was nightvision xl but now i use helloworld sdxl from civit, xl is getting better but more slowly than 1.5, no refiner

movie still of man wearing tank top, covered in blood standing on street full of intestines meat blood ,exploding cars ,backlight ,haze
Negative prompt: blurry,artifacts
Steps: 12, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 1278390959, Size: 1280x720, Model hash: 9aa0c3e54d, Model: nightvisionXL 7, VAE hash: 235745af8d, VAE: sdxl.vae.safetensors, ControlNet 0: "Module: instant_id_face_embedding, Model: ip-adapter_instant_id_sdxl [eb2d3ec0], Weight: 1, Resize Mode: Resize and Fill, Low Vram: False, Processor Res: 512, Guidance Start: 0, Guidance End: 1, Pixel Perfect: True, Control Mode: ControlNet is more important, Hr Option: Both, Save Detected Map: True", ControlNet 1: "Module: instant_id_face_keypoints, Model: control_instant_id_sdxl [c5c25a50], Weight: 1, Resize Mode: Resize and Fill, Low Vram: False, Processor Res: 512, Guidance Start: 0, Guidance End: 1, Pixel Perfect: True, Control Mode: ControlNet is more important, Hr Option: Both, Save Detected Map: True", Version: v1.7.0

huzheng0419 · 2024-04-15T09:20:19Z

huzheng0419
Apr 15, 2024

I uninstalled the CN on my computer and removed it from the https://github.com/huchenlei/sd-webui-controlnet/tree/fix_insight?tab=readme -OV file has re downloaded the CN update, and also re downloaded two processors and models, but the problem still exists. May I ask if the steps are incorrect.

0 replies

pavnilschanda · 2024-04-18T05:25:37Z

pavnilschanda
Apr 18, 2024

I tried this out but the results look wonky. Without the ControlNet (which has been updated), it (SDXL with its accompanying VAE) can generate images fine. Let me know what I should do.

watercolor painting
Steps: 22, Sampler: DPM++ 2M, Schedule type: Karras, CFG scale: 5, Seed: 2532559336, Size: 512x512, Model hash: 31e35c80fc, Model: sd_xl_base_1.0, ControlNet 0: "Module: instant_id_face_embedding, Model: ip-adapter_instant_id_sdxl [eb2d3ec0], Weight: 1, Resize Mode: Crop and Resize, Low Vram: False, Processor Res: 512, Threshold A: 0.5, Threshold B: 0.5, Guidance Start: 0, Guidance End: 1, Pixel Perfect: False, Control Mode: Balanced, Hr Option: Both, Save Detected Map: True", ControlNet 1: "Module: instant_id_face_keypoints, Model: control_instant_id_sdxl [c5c25a50], Weight: 1, Resize Mode: Crop and Resize, Low Vram: False, Processor Res: 512, Threshold A: 0.5, Threshold B: 0.5, Guidance Start: 0, Guidance End: 1, Pixel Perfect: False, Control Mode: Balanced, Hr Option: Both, Save Detected Map: True", Version: v1.9.0

I also got these if they may affect the result:

FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions.
To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`.
  P = np.linalg.lstsq(X_homo, Y)[0].T # Affine matrix. 3 x

A tensor with all NaNs was produced in VAE.
Web UI will now convert VAE into 32-bit float and retry.
To disable this behavior, disable the 'Automatically revert VAE to 32-bit floats' setting.
To always start with 32-bit VAE, use --no-half-vae commandline flag.

0 replies

booraddley · 2024-04-26T02:39:42Z

booraddley
Apr 26, 2024

It ran pretty well for me. I used one image

and I generated this image:

I'm on a 16-inch 2021 MacBook Pro , Apple M1 Max, 64Gigs of Ram.
I used:

Model: FullyREALXL_v80Refocus 
Sampler: DPM++ 3M SDE Exponential
Size: 1024x1024
Steps: 40
Clip Skip: 2
CFG: 4

Looks better than anything I've done with ReActor.
Hope this helps move the project along!! 😉

1 reply

booraddley Apr 26, 2024

Now that I've worked with this a bit more, using all the same stuff. I think your going to have a heck of a time getting a more posable head and neck with only those 5 facial keypoints. You'll also need a way measure and alter lighting/light source, which is it's own area of expertise when it comes to cgi work in films and animated films.

For fun, I layered the: facial keypoints, over an opacity altered version of my AI generated image, over the original image I got from a random google search. Here's the result:

The light source, the chin, the eyes, the mouth, the neck, all appear in the same spot in each of 13 different images I've generated. Each image had a prompt to try and alter something to change the look of the "pose" of the chin in relation to the neck, as well as the light source. I tried high angle I tried low angle, no change. I suppose that may be the idea for now.

Anyway, just wanted to share my thoughts.

fbauer-kunbus · 2024-04-26T07:20:44Z

fbauer-kunbus
Apr 26, 2024

Is there still no way to edit the keypoints or even create keypoints from scratch without using a reference image? The is an 'edit' button which opens Photopea - but I couldn't figure out what this is supposed to do, since Photopea doesn't seem to treat the keypoints as editable vectors - and even if it did - how would I save the edit so that it is then used by A1111 again? Am I missing a major thing here?

4 replies

booraddley Apr 26, 2024

They said it in the main post:
Make sd-webui-openpose-editor able to edit the facial keypoints in preprocessor result preview.
It's on their todo list.

fbauer-kunbus Apr 26, 2024

Yes, I read this - but it is a bit confusing that there is already an 'edit' button which in fact does nothing useful - without even a hint or a tooltip or - gosh - some kind of documentation. But what do you expect from people who think that the 'Gradio' GUI is 'fun' :-) The whole Open Source AI developer community got the focus completely wrong on 'features' and 'extensions' that are a dime a dozen - instead of taking the most obvious usability aspects into account.

booraddley Apr 26, 2024

Oh, that's just the controllnet mask. The only thing you can edit about that is by drawing on it like a canvas I think, but that would ruin it. We're pretty lucky to have Gradio & A1111 & StableForgeUI & ComfyUI & StableSwarmUI. Otherwise, we'd have to do this all through CLI or CMD.

fbauer-kunbus Apr 26, 2024

Guess it is debatable whether a bad GUI is better than a bunch of intelligent scripts - with good documentation and adequate examples. Being a software developer for decades myself I would very much prefer using image generation AI the way a 'normal' script with function calls works (heck - there is only a very minor 'visualization' part in the AI GUIs - the complete rest is a mere parameter and settings orgy and nested function calls, isn't it?). This would be WAY more transparent than what the GUIs now do - and don't get me started about the chaotic spaghetti nonsense of (Un-)ComfyUI. Look at how you normally build even complex web apps with (script) frameworks - include jQuery, and the stage is already set for most operations. It tells a lot that A1111 to the date isn't even capable to save and reload a project file which includes ALL settings and parameters and referenced file pathnames to recreate the very same image you created before. What are those people thinking who develop this? I have never in decades come across any reasonable GUI which couldn't do this. Or do you know of any system where you have to keep manual annotations on top of a project file to simply continue your work where you left off last week?
No, to my feel and experience the open source AI scene mainly suffers from still being to theoretical and 'academic' at its base and core. You totally feel, that most oft the main developers don't have usability and every day practical aspects in focus. Take for comparison a project like 'Obsidian.md' (a highly complex note-taking system) - the GUI is flexible and intelligently thought out by pros, which you can feel from the very start. But what was I expecting when 'Python' is the language of choice for AI projects ... a language who tried to invent the wheel again for the umpteenth time - and didn't even do it in a good way (sigh) ;-)

chace20 · 2024-04-29T02:24:53Z

chace20
Apr 29, 2024

I found ip-adapter_instant_id_sdxl and control_instant_id_sdxl model need too much CPU to run, nearly 40 core, or inference speed will slow down. Does anyone have a solution?

0 replies

throbvr · 2024-05-02T03:36:20Z

throbvr
May 2, 2024

Has anyone been able to get this working with Oobabooga text generation web ui via the API? Seems there would be some amazing possibilities...

0 replies

denisov7078 · 2024-05-04T19:49:51Z

denisov7078
May 4, 2024

After the 1.1.447 update, Instant ID stopped working, it gives an error

4 replies

MINGSONG4 May 5, 2024

I am facing same issue as well! Need to wait fix

huchenlei May 5, 2024
Collaborator Author

New update should fix this. It is really wield why the full qualified name reaches here. Can I confirm the gradio version you folks used?

denisov7078 May 5, 2024

Thanks, new update fixed the problem. gradio version 3.41.2

huchenlei May 5, 2024
Collaborator Author

There is a quick fix here: #2849.

A more proper fix & large refactoring on the way: #2847

BenDes21 · 2024-05-25T10:59:05Z

BenDes21
May 25, 2024

Hi there! Any tips for reduce the blurriness / improve the quality of the image ? Im using :

Epic realism SDXL
CFG 3
Steps 30
Res 1016x1016
Nothing has been modified on the controlnet side except " controlnet is more important " on both units

Only stuff I imagine can improve the quality is pixel perfect on both unit but my laptop is not powerful... worth to use it if I rent a gpu cloud ?

0 replies

ayush1268 · 2024-06-17T08:04:41Z

ayush1268
Jun 17, 2024

hey can we use this instant id in stable diffusion forge

0 replies

NewbieFish · 2024-07-10T12:55:29Z

NewbieFish
Jul 10, 2024

为什么我生成的效果这么差

1 reply

bzetu Mar 5, 2025

俺也一样

maskontheface · 2024-08-13T02:46:48Z

maskontheface
Aug 13, 2024

will this model be updated for the FLUX model?

0 replies

karp-80 · 2024-08-31T15:48:46Z

karp-80
Aug 31, 2024

All this does not work now, I tried it on different builds of SD. Automatic 1111 version v1.10.0, model juggernautXL_version6Rundiffusion.safetensors [1fe6c7ec54] although I tried another XL model, I tried different resolutions. In the first tab ControlNet Unit 0 [Instant-ID] Preprocessor instant_id_face_embedding, Model ip-adapter_instant_id_sdxl [eb2d3ec0], the second tab ControlNet Unit 1 [Instant-ID] Preprocessor instant_id_face_keypoints, Model control_instant_id_sdxl [c5c25a50] nothing happens, the face does not change. ControlNet was reinstalling. Does it even work for anyone now???

0 replies

wktra · 2024-10-19T01:44:01Z

wktra
Oct 19, 2024

Forge is now just a preview browser for Flux. They don't give a shit about actually using it in a professional or creative setting.

0 replies

Vander-Bilt · 2024-10-30T09:28:19Z

Vander-Bilt
Oct 30, 2024

Why do the characters I generate always appear in the middle of the image without any action, even though I specifically include actions in my prompt? I tried many many times, different prompts, control weight, and so on... but no lucky. Did I miss something?

mid shot,bust of A man is playing golf and is about to hit the ball,from front,
Negative prompt: nsfw,(low quality, worst quality:1.5),immature,cartoon,anime,3d,painting,

Steps: 28, Sampler: DPM++ 2M, Schedule type: Karras, CFG scale: 3, Seed: 2070426131, Size: 1280x768, Model: DreamShaperXL_Turbo_v2_1, VAE: sdxl_vae.safetensors, ControlNet 0: "Module: instant_id_face_embedding, Model: ip-adapter_instant_id_sdxl [eb2d3ec0], Weight: 1.0, Resize Mode: Resize and Fill, Processor Res: 768, Threshold A: 0.5, Threshold B: 0.5, Guidance Start: 0.0, Guidance End: 1.0, Pixel Perfect: True, Control Mode: My prompt is more important", ControlNet 1: "Module: instant_id_face_keypoints, Model: control_instant_id_sdxl [c5c25a50], Weight: 1.0, Resize Mode: Resize and Fill, Processor Res: 512, Threshold A: 0.5, Threshold B: 0.5, Guidance Start: 0.0, Guidance End: 1.0, Pixel Perfect: True, Control Mode: My prompt is more important", Version: 1.10.1

1 reply

hansolocambo Nov 29, 2024

InstantID finds the face and generates what's around with your prompt. If your face ends up always in the middle that's because the head in your image is... always in the middle. Duh.

Move the head or the whole image, doesn't matter.

ikHwq · 2024-11-01T15:45:08Z

ikHwq
Nov 1, 2024

I would like to ask a question. When using Instant ID, both the preprocessor and model have been configured accordingly. Clicking "Allow Preview" will display an error. console output
Urllib. error. URLuError:<urlop error [WinError 10060] The connection attempt failed due to the connecting party not responding correctly after a period of time or the connected host not responding

0 replies

[1.1.439] Instant ID #2589

Uh oh!

Uh oh!

huchenlei Jan 27, 2024 Collaborator

Instant ID project

Download models

How to use

Unit 0 Setting

Unit 1 Setting

CFG

Output

Follow-up work

Note

Known issues

Replies: 67 comments · 147 replies

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

huchenlei Jan 28, 2024 Collaborator Author

Uh oh!

Uh oh!

Uh oh!

Uh oh!

huchenlei Jan 28, 2024 Collaborator Author

Uh oh!

Uh oh!

huchenlei
Jan 27, 2024
Collaborator

huchenlei Jan 28, 2024
Collaborator Author

huchenlei Jan 28, 2024
Collaborator Author