Could codegen make this job type possible?

Consider this situation:

  • MyJob lives in a package
  • MyJob takes a function pointer or generic struct type to extend its functionality in its update loop logic
  • I am in my project using the MyJob package, and the function pointer I want to pass to MyJob would need component data access that MyJob couldn’t have planned for in advance. For example it could want to check if a certain entity has a certain component specific to my project

I’d like this use case to be as simple, straightforward, and not-weird as possible for users that aren’t very DOTS-savvy and wish to use such functionality. (the kind of users that would probably just stick to a simple Entities.ForEach for everything). I do know about ways around this problem currently, but they all feel a bit too complex and scary for beginners

So I’m interested in the possibility of a codegen’d job type that would allow this: I want to be able to create a job that doesn’t necessarily have to know in advance all the component data access it will require. It could just be generated based on whatever component types/accesses it ends up needing in all the places in the codebase where we ask for specific component accesses from this job. This would allow us great flexibility in creating several “variants” of a generic job. Kind of a replacement for what you’d do with virtual function overrides in OOP.

In this code example, the “DoSomethingInJobIteration()” function reads/writes to rotation even though the “MyJob” calling this function was never made aware in its own code that it might need rotation access in its own update loop. MyJob would know which component sets to operate on based only on an EntityQuery:

Basic code example

public static class MyUtilities
{
    public static void DoSomethingInJobIteration(in MyJob job)
    {
        // make the entity rotate
        Rotation rotation = job.GetComponent<Rotation>();
        RotationSpeed speed = job.GetComponent<RotationSpeed>();

        rotation.Value = math.mul(math.normalize(rotation.Value), quaternion.AxisAngle(math.up(), speed.RadiansPerSecond * job.DeltaTime));

        job.SetComponent(rotation);
    }
}

[BurstCompile]
struct MyJob : IJobEntityGeneric
{
    public float DeltaTime;

    public void OnUpdateEntity()
    {
        MyUtilities.DoSomethingInJobIteration(in this);
    }
}

With this sort of job API, we could imagine that creating a custom “variant” of a system/job would be as simple as this:

Extending job functionality code example

public class MyCustomSystemVariant : MyBaseSystem<MyCustomLogicHandler>
{ }

public struct MyCustomLogicHandler : IJobCustomLogicHandler
{
    // This gets called by the MyBaseJob<T> job in MyBaseSystem<T>
    // MyBaseJob also does logic of its own before/after this gets called
    public void DoSomethingInJobIteration(ref MyBaseJob<IJobCustomLogicHandler> job)
    {
        // This function uses Translation access even though no Translation access was declared in MyBaseJob in any way
        MoveToTarget moveToTarget = job.GetComponent<MoveToTarget>();
        Translation selfTranslation = job.GetComponent<Translation>();
        Translation targetTranslation = job.GetComponentOnEntity<Translation>(moveToTarget.TargetEntity);
        selfTranslation += math.normalizesafe(targetTranslation - selfTranslation) * moveToTarget.speed;
        job.SetComponent(selfTranslation)
    }
}

Note: The generated data arrays in this job would either be ComponentTypeHandles, or ComponentDataFromEntity, or [ReadOnly], etc… depending on all of the combined needs of every function in the codebase that uses MyJob.Get/SetComponent(…). If the Base job had write access to Rotation on the current entity in iteration, we would just generate a ComponentTypeHandle and write in chunks. BUT, if one of the extension functions requires read access to RotationFromEntity as well, it would end up generating a [NativeDisableParallelForRestriction] ComponentDataFromEntity instead. Etc…

It would be a very “accessible” option for making jobs I think, despite the drawbacks of manual writeback into component arrays, and not having clear visibility on what types those jobs get access to. We’d basically not have to care anymore about how to obtain access to the data we need in a job, and the end result would be as highly-optimized as doing things manually in an IJobChunk. We’d also have to figure out how to inform users of parallel writing from entity risks and make them manually acknowledge that they understand what they’re doing. Maybe with an attribute over their custom function

tagging @joepl

Maybe I’m misunderstanding, but can’t you achieve this with function pointers already?

the issue is that if you pass a function pointer to a job that only has rotation access, and your function pointer would like to have translation access too, you can’t make it work unless you manually go add translation access in the job and pass it along

The idea I’m presenting is a way for jobs to not have to already know every possible component access that function pointers passed to it might possibly need. So with my proposed approach, the codegen would scan the codebase, see that there is a function somewhere that expects to have translation access from MyJob, and then add the required translation arrays to MyJob. If there’s another function somewhere else that needs write access to PhysicsVelocity from MyJob, it would also add PhysicsVelocity arrays to MyJob, etc…

The generated data arrays would either be ComponentTypeHandles, or ComponentDataFromEntity, or [ReadOnly], etc… depending on all of the combined needs of every function in the codebase that uses MyJob.Get/SetComponent(…)

(if that is technically possible, that is)

Ah, not sure if this will help you but I had to implement something like this in my (utility) AI system, however I decided to avoid using any code gen (my original implementation required it though but I figured out how to go without.)

Basically the AI tree is generated from a node graph but the AI system has no idea what this graph is. Users can implement custom nodes with data requirements etc.

However the entire thing runs off a very small, single job. It turned out very elegant. I wasn’t intending to show this off till I had finished with the node editor.

public class UtilityAISystem : SystemBase
{
    private static readonly ProfilerMarker PopulateContextMarker = new ProfilerMarker("PopulateContext");

    private readonly IAISettings settings;

    private NativeHashMap<int, Reference<Selector>> aiUtilityTrees;
    private NativeArray<Reference<Selector>> treeRoots;

    /// <inheritdoc/>
    protected override void OnUpdate()
    {
        using (PopulateContextMarker.Auto())
        {
            // Populate all trees with current frame data
            foreach (var tree in this.treeRoots)
            {
                tree.Value.PopulateContext(this, this.settings);
            }
        }

        var utilityTrees = this.aiUtilityTrees;

        this.Entities
            .ForEach((Entity entity, in AI ai) =>
            {
                var selector = utilityTrees[ai.Current];
                selector.Value.Select(entity).Value.Execute(entity);
            })
            .WithReadOnly(utilityTrees)
            .ScheduleParallel();
    }

There we have it. I have just publicly posted my entire AI system enjoy!
Reference is pretty much just a pointer.
Populate context iterates the entire graph and assigns all the required GetComponentFromEntity or user assigned data etc.

The main trick for my implementation though is it requires c#8 (so 2020.2a) to get access to Unmanaged constructed types

-edit-

my point is, you can pass a pointer in to a struct that has the data it requires.
blobassetreference works fine

The downside I see in this approach (and correct me if I’m wrong) is that it forces all of your data to be “FromEntity” because you have to plan for the worst case scenario of data access, so there is a performance loss

I’m guessing it’s not a big deal in your case because AI is surely a very non-linear problem though

You’re right, it’s definitely the issue but I feel you could maybe use the same concept with IJobChunk and ComponentTypeHandle; though I haven’t tried or thought about the problem much.

(side note, I only just realized they renamed ArchetypeChunkComponentType to ComponentTypeHandle)

The problem I’ve encountered regarding this:

  • My BaseJob has chunk iteration for Translation, Rotation, PhysicsVelcity, and 4 other component types. I want this for performance
  • My custom function that gets passed to the job might want to read the translation on another entity, for example. I can’t know in advance
  • Just because of that, my 7 chunk-iterated arrays in the base job have to be ComponentDataFromEntity instead. Just in case the custom function passed to the job needs them
  • I cannot add an extra ComponentDataFromEntity to the job via a generic struct, because that would do aliasing errors with the existing translation chunk arrays

All in all, replacing my chunk iteration with FromEntity iteration gives me a ~10-15% performance loss

It feels like an unfortunate sacrifice, especially since the use cases for FromEntity acces to one of those 7 types are possible but rare. And this is pretty much the point in my reasoning where I came up with the proposal of the codegen job of the first post. So that it optimizes the job’s data arrays to fit the requirements of anything that might use it. And if it turns out that nothing needs a translationFromEntity from this specific job in a given project, the codegen would make sure that translation remains chunk-iterated rather than FromEntity

Is this an attempt to try and offer an integration layer to your end users? Few thoughts…

The custom function approach seems problematic. So your job does what it does, produces the data it needs for that. Another concern should handle getting component references on it’s own. Like you provide a callback with your data, what the end user does with it that’s up to them. They sort out the dependencies, you might provide some input/output dependencies to help them. But keep separate concerns separate. What component data some end user might want for reasons you have no idea of are their concern.

The graph idea above follow this paradigm also, the graph doesn’t try to solve for fetching data individual nodes need that’s up to them.

I can’t see any good reason why your jobs have to fetch data for concerns you don’t even know about. That just seems like a bad idea on it’s face.

It’s pretty much just something that would generate an exact equivalent of an IJobChunk that we would setup manually to be optimized for the exact requirements of our project, but it does all the manual work for us.

When I make a system that lives in a package, I cannot know the exact requirements of the projects it will be used in in advance. Therefore; my only option would be to plan for the worst case scenario of how much data access those projects will need, and make everything 100% accessible (by making everything work with ComponentDataFromEntity, just in case). That means I get a performance loss that is probably not justified for most projects. See description of problem here .

This sort of codegen is the only way I can think of to not have to make that sacrifice if the project doesn’t need it. Actually I can see an alternative where 100% of the job setup is handled by users, but I’ve tested out that approach in practice to give it a try, and it’s no good. It would require users to write hundreds of lines of code, with many ways things could go wrong if they don’t do the right thing in the right order, just to create a simple moving character. It also requires tons of confusing interface callbacks that for example asks you to write back Translation into arrays, but it gives you either the entity or the chunk index to write to, depending on whether you ended up with chunk iteration or FromEntity iteration for your Translation. That’s just bound to confuse people. And that’s even if I assume that they’ll have access to the eventual IJobEntity which hides a lot of the IJobChunk boilerplate

I definitely see some weirdness in the fact that any code in the project that asks for a component type for a certain job will end up contributing to that job’s data access, regardless of it it will actually be used or not and regardless of any notion of game context…

But perhaps there are other ways to make this work without resorting to that

I also think about how easy and accessible this would make dots programming, with few disadvantages other than low visibility on a job’s data access (unless we have some editor tool to debug it), or someone mistakingly making a job function that is never used and giving the job more data than it needs. I think it’s worth considering at least

Isn’t something similar already achievable by chaining jobs? If your intent is to create some API to deal with your package and the user will need to insert code at some point (be it before or after some API call) seems like that just creating smaller jobs - one that preprocess the data that the user will need to modify and another one to receive the data postprocessed by the user - will achieve the exact same goal without any codegen or complication involved.

Another option is to just create the methods in some static utility class and let the user deal with all the boilerplate needed to either use ComponentTypeHandle or ComponentDataFromEntity, but if it is the boilerplate that you want to avoid then the first option would be the way to go.

I think I might just have a really tough & specific case in my hands. I’ve calculated the number of jobs required to schedule if I go with the job chaining approach, and it would be something like 40 jobs per “variant” of my system. In practice that could mean hundreds of jobs scheduled just for that one gameplay feature, because of the many variants

The link at the end of the “additional comments” spoiler of my first post explains why I need so many jobs (different use case but similar to mine)

Oh, didn’t saw the second spoiler tag Haha

In that case, I would say that preparing for the worst (FromEntity everywhere) would be ok. Not sure how codegen could optimize all that work for you without falling into the issue where we will need to check the decompile window all the time to ensure that the things are running as planned. In the end I think it is the usual tradeoff between something generic/reusable and something specific, with the later being more performant almost always.

But it would be cool to know that Unity already have something in mind to improve code re-use.

Didn’t read the thread carefully, sorry if missed something, but if you sure in safety you can disable aliasing error and use handle and CDFE at the same time (Not checked with generic, but with non generic jobs it will work fine)

[ReadOnly\WriteOnly\Without attribute]
public ComponentTypeHandle<Translation> translationHandle;
[NativeDisableContainerSafetyRestriction]
public ComponentDataFromEntity<Translation> translationData;
2 Likes

Oh I didn’t know that was possible! That could be a decent solution

However I don’t fully grasp the concept of aliasing and I imagine the error is there for a reason. What should I be aware of, when disabling the restrictions? Is it just that I need to realize that something else might be wanting to write to that array before/after I write to it myself?

If you disable this restriction you should be sure that no one will read and write \ write and write at the same element from CDFE and ComponentTypeHandle at the same time as it will be a possible hidden race condition with very weird and unexpected results otherwise.

1 Like

I see. In this case my CDFE would always be readonly so this is looking like a good plan, thanks