PInvoke, mono_add_internal_call, C and C#

Hi there,

I’m posting this since it may be useful to someone else and also as a way of verifying my conclusions. This is in no way meant as a C is better or C# is better, it’s just data that I used to make a decision.

Based on the findings below, I decided to write a wrapper to the Chipmunk physics library, rather than port it to C# (the summary is that the calling overhead to C/Objective C is negligible, particularly when compared to the performance of C#, which is this case is two orders of magnitude below C).

I also found out that Unity isn’t using P/Invoke, or if it is, then its performance when compared to mono’s internal call is very similar.

So, here’s the code I used.

C#:

using UnityEngine;
using System.Collections;
using System.Runtime.CompilerServices;
using System.Runtime.InteropServices;
using System;


public class Hello
{	
	static int val=0;
		
	[MethodImplAttribute(MethodImplOptions.InternalCall)]
	public extern static int RetInt();	
	
	[DllImport ("__Internal")]
	public extern static int RetIntSlow();	
	
	[DllImport ("__Internal")]
	public extern static void TestIntC(int millions);	
	
	public static int RetSqrtCSharp()
	{
		val++;
		return (int)Mathf.Sqrt(val);	
	}	
	
}


public class InteropCallDemo : MonoBehaviour 
{	
	delegate int IntegerRet();
	
	//Quick method to call an int return method some million times
	void CallMillionTimes(int numMillions,IntegerRet method)
	{
		DateTime time=System.DateTime.Now;
		int total=0;
		for (int n=0;n<numMillions*1000000;n++)
		{
			total+=method();	
		}
		print("Total="+total); //Using total, so the compiler doesn't optimize and gets rid of the code above.
		DateTime now=System.DateTime.Now;
		
		System.TimeSpan diff=now.Subtract(time);
		print("Spent "+diff.TotalMilliseconds+" ms");
	}
	
	
	// Use this for initialization
	void Start () 
	{		
		//Number of calls made to benchmark functions
		int numMillion=100;
		
		//Test mono_internal_add_call method		
		print("intFast performance for "+numMillion+" million calls");
		CallMillionTimes(numMillion,Hello.RetInt);
		
		print("intSlow performance for "+numMillion+" million calls");		
		CallMillionTimes(numMillion,Hello.RetIntSlow);
		
		print("Performance in C for a method that sqroots "+numMillion*100+" million times.");
		Hello.TestIntC(1);
			
		print("\nPerformance in C# for a method that sqroots "+numMillion+" million times.");
		CallMillionTimes(numMillion,Hello.RetSqrtCSharp);			
		
	}
	
}

C:

extern "C" {
    
    int val=0;
    
    static int RetInt()
    {
        val++;
        return val;
    }
    
    int RetIntSlow()
    {
        val++;
        return val;
    }
    
    int RetSqrtC()
    {
        val++;
        return sqrt(val);
    }
    
    int diff_ms(timeval t1, timeval t2)
    {
        return (((t1.tv_sec - t2.tv_sec) * 1000000) +
                (t1.tv_usec - t2.tv_usec))/1000;
    }
    
    void TestIntC(int numMillions)
    {
        
        timeval start, finish;
        gettimeofday(&start,NULL);
        
        int total=0;        
        for (long count = 0; count <numMillions*1000000; count++)
        {
            for (long count2 = 0; count2 <100; count2++)
            {
                total+=RetSqrtC();
            }
        }
        
        gettimeofday(&finish,NULL);
        printf("\nTotal %d",total); //Using total, so the compiler doesn't optimize and gets rid of the code above.
        printf("\nSpent %d ms",diff_ms(finish, start));        
        
    }
    
    
};


void mono_internal_initialize()
{    
    mono_add_internal_call("Hello::RetInt", (void*)RetInt);
}

The result:

intFast performance for 100 million calls
Spent 18765.762 ms

intSlow performance for 100 million calls
Spent 19340.145 ms

Performance in C for a method that sqroots 10000 million times.
Spent 15210 ms

Performance in C# for a method that sqroots 100 million times.
Spent 77821.272 ms

The times above are for an iPhone 4, with a release version.

So, 100 million calls to a C function took 19 seconds, which makes the calls negligible in cost, since the cost per call is around 0.00019 ms. This means that even if I called some C method 1000 times per frame, it would only take 0.19 ms.
To make things even better, I’m using delegates here, which are most likely introducing a relevant overhead (haven’t checked the CIL code or timed it).

It was interesting to find that mono_add_internal_call is basically the same as the DllImport method, which I was assuming used P/Invoke.

The last two times are a bit more indicative on the why I’m choosing to write a wrapper for Chipmunk. C# is two orders of magnitude slower in a simple sqroot method. I was actually surprised by this. So surprised that I had to tweak the C code to do 100 times more calculations (when it was the same as C# it took near 0 seconds).

Now I’m going to find out about the performance of UnitySendMessage versus unmanaged to managed thunks. Has anyone done any testing here?
Chipmunk has a lot of callbacks and I need to make sure there isn’t a bottleneck in UnitySendMessage.

Feel free to chime in, particularly if you disagree with any of my conclusions.

Cheers,
Alex

Interesting!

I’m very interested in using an external physics engine. I’m a bit unclear as to how to go about it though. I have one in c# that works in unity. I have no problem porting it to another language if need be. Do you need a pro licence to do it? Are there any resources you can point me towards on what wrapping is or how to do it?

Thanks

The problem with a C# physics engine is the massive overhead when calling Mathf methods. It’s much more efficient to write a wrapper to a native physics engine (much like Unity did). From what I can remember, when calling a Mathf method, Mono (and most likely .Net) don’t implement it natively, but they wrap it using P/Invoke. Now imagine the cost of calling any Mathf method in a tight loop…

To see how Unity did it, you merely have to decompile the relevant Unity dlls. Use reflector or resharper to do that and you’ll see that Unity extensively uses P/Invoke. Additionally you can use unsafe in C# to avoid data marshalling (see here: http://www.snowbolt.com/index.php/blog/28-tech/91-pinvoke ).

A Pro license is not needed for iOS and Android, but it’s needed for Windows and Mac.

Finally, I’m glad I never started writing a Chipmunk wrapper, because the Chipmunks themselves are doing it :smile:

Please pledge here to see what seems to be a very nice Chipmunk integration into Unity: http://www.kickstarter.com/projects/howlingmoonsoftware/chipmunk2d-for-unity

Cheers,
Alex

Pro license required to use P/Invoke? I don’t think so. I’ve built game with following code (built because I’ve heard Unity sometimes execute pro-only things in editor):

public enum MessageBoxResult : uint
{
   Ok = 1,
   Cancel,
   Abort,
   Retry,
   Ignore,
   Yes,
   No,
   Close,
   Help,
   TryAgain,
   Continue,
   Timeout = 32000
}
public enum MessageBoxOptions : uint
{
   OkOnly  = 0x000000,
   OkCancel  = 0x000001,
   AbortRetryIgnore  = 0x000002,
   YesNoCancel  = 0x000003,
   YesNo  = 0x000004,
   RetryCancel  = 0x000005,
   CancelTryContinue  = 0x000006,
   IconHand  = 0x000010,
   IconQuestion  = 0x000020,
   IconExclamation  = 0x000030,
   IconAsterisk  = 0x000040,
   UserIcon  = 0x000080,
   IconWarning  = IconExclamation,
   IconError  = IconHand,
   IconInformation  = IconAsterisk,
   IconStop  = IconHand,
   DefButton1  = 0x000000,
   DefButton2  = 0x000100,
   DefButton3  = 0x000200,
   DefButton4  = 0x000300,
   ApplicationModal  = 0x000000,
   SystemModal  = 0x001000,
   TaskModal  = 0x002000,
   Help  = 0x004000,
   NoFocus  = 0x008000,
   SetForeground  = 0x010000,
   DefaultDesktopOnly = 0x020000,
   Topmost  = 0x040000,
   Right  = 0x080000,
   RTLReading  = 0x100000
}

public class Test : MonoBehaviour {

   [DllImport("user32.dll", CharSet=CharSet.Auto)]
   public static extern MessageBoxResult MessageBox(IntPtr hWnd, String text, String caption, int options);
   
   // Use this for initialization
   void Start () {
     MessageBox(IntPtr.Zero, "Text", "Caption", 0);
   }
}

Then put it on empty game object and it works both in editor and in build.

P/Invoke is free. What’s Pro is Unity plugins, completely different thing.

How do you get the mono_add_internal_call function?

2 Likes