Worker's Execute/Schedule method not non-blocking?

Hello !

I’m trying to wrap my head around running language models in inference mode at runtime without losing too many frames.

The problem is that a simple Execute or Schedule (depending on Sentis version 1.6 or 2.0 respectively) call that should be non-blocking is freezing the main thread :sweat_smile: and can’t be called on another one.

Is this normal and I just misinterpreted the “non-blocking” expression?

I’m joining a sample script if you want to try it for yourself: Phi3Sentis2.cs (1.5 KB) (don’t mind the initial loading time and press return after it to trigger the worker’s scheduling method)

I’m running this on Unity 2023.2.6f1 with Sentis 2.0 and the model to run can be found here: Sentis Phi 3.5 Uint8

Will I experience the same kind of inevitable processing time using stuff like ReadbackRequest that is marked as async?

Thank you in advance for your help :pray: