Clever way to shuffle a List<T> in one line of C# code

casimps1 · April 16, 2014, 9:21pm

I had been planning on writing a fairly traditional shuffle algorithm for my generic Lists when I ran across this clever little trick on another site. It uses System.Linq and lambda expressions to do the magic in one nice clean line:

shuffledList = myList.OrderBy( x => Random.value ).ToList( );

Just thought I’d share this little shortcut.

Also feel free to tell me why doing it this way is a horrible idea.

I know it’s somewhat inefficient (it is after all actually doing a true sort rather than a shuffle, so technically we’re talking about going from O(n) to O(n log n) in complexity) so YMMV… but as long as you’re not shuffling lists of massive size every frame it shouldn’t matter at all.

Smooth_P · April 16, 2014, 9:41pm

Actually, this is likely going to be O(n²) time complexity and O(n) space complexity with two new lists created as well as other temporary objects.

If your collection implements IList (eg: T[ ], List) it would be much better to do an in-place shuffle with an extension method:

(This code is from Smooth.Foundations, which I will be submitting to the asset store as a free-to-use, free-to-distribute package as soon as I finish cleaning some things up and complete the autodocs.)

public static class IListExtensions {
	/// <summary>
	/// Shuffles the element order of the specified list.
	/// </summary>
	public static void Shuffle<T>(this IList<T> ts) {
		var count = ts.Count;
		var last = count - 1;
		for (var i = 0; i < last; ++i) {
			var r = UnityEngine.Random.Range(i, count);
			var tmp = ts[i];
			ts[i] = ts[r];
			ts[r] = tmp;
		}
	}
}

JamesLeeNZ · April 17, 2014, 12:52am

LinQ throws up some extra garbage for the GC as well, so avoid unless you’re doing it at a low frequency.

Rockaso · April 21, 2020, 3:11pm

Smooth_P:

Actually, this is likely going to be O(n²) time complexity and O(n) space complexity with two new lists created as well as other temporary objects.

If your collection implements IList (eg: T[ ], List) it would be much better to do an in-place shuffle with an extension method:

(This code is from Smooth.Foundations, which I will be submitting to the asset store as a free-to-use, free-to-distribute package as soon as I finish cleaning some things up and complete the autodocs.)
public static class IListExtensions {
    /// <summary>
    /// Shuffles the element order of the specified list.
    /// </summary>
    public static void Shuffle<T>(this IList<T> ts) {
        var count = ts.Count;
        var last = count - 1;
        for (var i = 0; i < last; ++i) {
            var r = UnityEngine.Random.Range(i, count);
            var tmp = ts[i];
            ts[i] = ts[r];
            ts[r] = tmp;
        }
    }
}

thanks

Eiznek · April 21, 2020, 4:23pm

Simplest way to “Shuffle” is just to order by random guid, I actually did this for a job code interview.

// Say we have listOfThings filled with things.
var listOfThings = new List<string>()
{
   "1", "2", "3", "4"
};
// Randomly Order it by Guid..
listOfThings = listOfThings.OrderBy(i => Guid.NewGuid()).ToList();
// It's now shuffled.

I read all about fisher yates before just implementing this, I don’t completely remember, but I think there isn’t much difference between the two? (I did get that job)

lordofduct · April 21, 2020, 5:16pm

I love when people reduce something to “a line of code” forgetting that their line of code is actually calling multiple functions that do a lot of the heavy lifting for you and result in more work being done then if you just wrote out what you’re doing (and that could be encapsulated into its own method itself to make a “single line of code”).

I also fail to see how sorting on a guid is the “simplest” relative to sorting on Random.value. They’re both random values. How is one simpler than the other? In what aspect? I mean… one could argue Guid.NewGuid is more complex since the algorithm for generating a guid is more complex than the algorithm for getting the next random value in a sequence. Of course one could argue that because Random.value must maintain a state it’s more complex in the form of memory foot print… even though we’re talking in the couple of bytes.

I don’t know… call me weird. But I do the same approach posted 6 years ago by Smooth-P. Seems pretty simple to me, as well as being the most efficient of the lot (I believe it’s called the Fisher-Yates shuffle).

I should point out also that said algorithm creates what is known as an “unbiased permutation”… it means that all permutations of the shuffle have equal odds. As opposed to the OrderBy(random) which is biased due to the fact it exploits the underlying QuickSort algorithm and done repeatedly results in a pattern of permutations that are favored over others. So not only is it more time complex, but it results in less statistical variation… which in most games doesn’t matter. But if say you were creating a game for gambling (which Unity doesn’t allow without a special license), the algorithm actually wouldn’t meet the regulatory requirements of most state gambling boards.

PraetorBlue · April 21, 2020, 5:30pm

If you write the fisher-yates shuffle extension method and just call that it’s also “one line of code” ¯_(ツ)_/¯

lordofduct · April 21, 2020, 5:36pm

Like Smooth-P’s extensions method.

Kurt-Dekker · April 21, 2020, 5:56pm

Yes, I’m with you duct (and by extension I’m with Smooth-P as well)… there is no value in packing everything into one line, especially chained Linq statements.

Putting all your code on one line only says:

“I would like to hide all possible future bugs and malfunctions inside code that makes it extremely irritating to actually find the bug, code which will cause me to have to post non-line-wrapped 256-character lines of parenthesis and dot-infested nightmare code to the Unity forums saying that I have a null reference exception somewhere on that line and I can’t find it.”

Just don’t do it. You’re NOT fooling anyone (certainly not the compiler) and you’re wasting your own time first and foremost.

Eiznek · April 21, 2020, 6:07pm

lordofduct:

I love when people reduce something to “a line of code” forgetting that their line of code is actually calling multiple functions that do a lot of the heavy lifting for you and result in more work being done then if you just wrote out what you’re doing (and that could be encapsulated into its own method itself to make a “single line of code”).

I also fail to see how sorting on a guid is the “simplest” relative to sorting on Random.value. They’re both random values. How is one simpler than the other? In what aspect? I mean… one could argue Guid.NewGuid is more complex since the algorithm for generating a guid is more complex than the algorithm for getting the next random value in a sequence. Of course one could argue that because Random.value must maintain a state it’s more complex in the form of memory foot print… even though we’re talking in the couple of bytes.

I don’t know… call me weird. But I do the same approach posted 6 years ago by Smooth-P. Seems pretty simple to me, as well as being the most efficient of the lot (I believe it’s called the Fisher-Yates shuffle).

I should point out also that said algorithm creates what is known as an “unbiased permutation”… it means that all permutations of the shuffle have equal odds. As opposed to the OrderBy(random) which is biased due to the fact it exploits the underlying QuickSort algorithm and done repeatedly results in a pattern of permutations that are favored over others. So not only is it more time complex, but it results in less statistical variation… which in most games doesn’t matter. But if say you were creating a game for gambling (which Unity doesn’t allow without a special license), the algorithm actually wouldn’t meet the regulatory requirements of most state gambling boards.

It’s high level coding, As a developer constantly create more and more tools until my code is increasingly simplified. It’s just like using c# over using c++, it’s a higher level language. No longer dealing with pointers, and addresses.

As you become more fluent in your language you’d want these assists. It helps you get to your overall goal that much faster. No need to constantly recreate the wheel.

lordofduct · April 21, 2020, 6:38pm

Nothing in my post said reusable code was bad. Reusable code is fantastic.

I was criticizing reducing code to a single line while not understanding what’s going on underneath. That single line of code exploiting QuickSort algorithm which results in a less efficient O(NlogN) to as much as O(N^2) operation that has a biased set of results. Where as Smooth-P’s approach utilizes a known good O(N) algorithm (fisher-yates) done up as an extension method (so it’s reusable) and can be written in a single line of code on use:

lst.Shuffle();

OR

What @Kurt-Dekker said

Owen-Reynolds · April 21, 2020, 9:29pm

It may not be fair: if you shuffle {1,2,3,4,…10} fairly, any number should have a 10% chance of being in any slot. Depending on the exact sort method (quick sort, but bubble for <20?) numbers may not move very far from their starting position, or may tend to fall into some other pattern.

Boz0r · April 22, 2020, 8:53am

Isn’t that more related to how “random” your random function is? If the distribution is completely uniform, the sorting method shouldn’t matter.

Cannist · April 22, 2020, 9:40am

That is only true if the assignment of random values induces a total order, i.e. each of the elements gets a different random number assigned.As soon as you have duplicates it’s up to the sorting algorithm to break the ties.

Owen-Reynolds · April 22, 2020, 3:07pm

The first example, 6 years ago , used Random.value, which is a coin flip – each time you compare items they roll 0-1 for themselves, highest wins. Compare again, and they each roll again. Since Bubble sort swap side-by-side items, the fist item has a 50% chance to stay in place, 25% to move up by only 1, 12.5% to move up 2 spaces… . C#'s sort says it uses Insertion sort for 16 or fewer items (but C# docs are often wrong). That gives us the same problem – in an insertion sort, item 10 compares itself with items 9,8,7… until it finds a smaller one, so it still uses a coin flip to move 1 space, and won’t move very far.

Sure, the guid idea will sort of work (I’m assuming there’s a hash involved, and hashes are purposely designed to look random). Assigning a constant “random” value to each item will give a random-seeming shuffle. But only once. You’re probably going to want to shuffle again.

lordofduct · April 22, 2020, 4:17pm

Not so true, as @Owen-Reynolds pointed out in his last post.

This is what I was talking about in my prior posts when I said that it will create a ‘biased’ result.

You can see a description here on the Fisher-Yates wikipedia article about using sort on random value here:
https://en.wikipedia.org/wiki/Fisher–Yates_shuffle#Sorting

A variant of the above method that has seen some use in languages that support sorting with user-specified comparison functions is to shuffle a list by sorting it with a comparison function that returns random values. However, this is an extremely bad method: it is very likely to produce highly non-uniform distributions, which in addition depends heavily on the sorting algorithm used.[10][11] For instance suppose quicksort is used as sorting algorithm, with a fixed element selected as first pivot element. The algorithm starts comparing the pivot with all other elements to separate them into those less and those greater than it, and the relative sizes of those groups will determine the final place of the pivot element. For a uniformly distributed random permutation, each possible final position should be equally likely for the pivot element, but if each of the initial comparisons returns “less” or “greater” with equal probability, then that position will have a binomial distribution for p = 1/2, which gives positions near the middle of the sequence with a much higher probability for than positions near the ends. Randomized comparison functions applied to other sorting methods like merge sort may produce results that appear more uniform, but are not quite so either, since merging two sequences by repeatedly choosing one of them with equal probability (until the choice is forced by the exhaustion of one sequence) does not produce results with a uniform distribution; instead the probability to choose a sequence should be proportional to the number of elements left in it[citation needed]. In fact no method that uses only two-way random events with equal probability (“coin flipping”), repeated a bounded number of times, can produce permutations of a sequence (of more than two elements) with a uniform distribution, because every execution path will have as probability a rational number with as denominator a power of 2, while the required probability 1/n! for each possible permutation is not of that form.

Because because the Random.value is uniform, it basically causes the sorting algorithm to express its algorithm in your results.

Note that the “variant of the above” is referring to this paragraph:

An alternative method assigns a random number to each element of the set to be shuffled and then sorts the set according to the assigned numbers. The sorting method has the same asymptotic time complexity as Fisher–Yates: although general sorting is O(n log n), numbers are efficiently sorted using Radix sort in O(n) time. Like the Fisher–Yates shuffle, the sorting method produces unbiased results. However, care must be taken to ensure that the assigned random numbers are never duplicated, since sorting algorithms typically don’t order elements randomly in case of a tie.[9] Additionally, this method requires asymptotically larger space: O(n) additional storage space for the random numbers, versus O(1) space for the Fisher–Yates shuffle. Finally, we note that the sorting method has a simple parallel implementation, unlike the Fisher–Yates shuffle, which is sequential.

Basically if you were to assign unique random values to each element, and then sort, this would be completely suitable. But that’s not what passing an RNG into a sort method does.

Cannist · April 22, 2020, 5:42pm

Indeed I had missed that in the first post a random value was assigned each time the element would be inspected by the sorting algorithm. Thanks for clarifying.

If I wanted to nitpick I could still uphold my claim that if a total order is induced the assignment of random numbers the sorting algorithm would not introduce any bias. In all the counter examples we do not have a total order. But really, I just missed that we assign multiple different random values to the same element.

lordofduct · April 22, 2020, 10:03pm

It’s not that multiple values to the same element every time it’s inspected. But also the values aren’t unique across the set which means that ties won’t result in an actual “random” shift, it’ll always shift in the direction the algorithm does. Which also would end up statistically expressing itself.

A constant random value needs to be assigned to each element, and that value has to be unique across the entire set. (which is what the wiki article I quoted is saying). Then, yes, sorting on a random value will work.

So if you had:
A, B, C, D

And you assigned:
44, 243, 112, 6 (random byte)

That’d be fine.

But if you assigned
3,1,2,1 (random range 1-4)

Not fine, because 1 and 1 would tie, and the sort algorithm would shift B and D based on its rules, and not on an actual randomness (may it be it takes first, takes, second, whatever… depends on the algorithm). This would introduce a bias as well.

Thing is… how do you get a unique random set? Most simple algorithms out there suggest, get this, to fill a list with unique values and then shuffle them. lol.

This is why technically the Guid.NewGuid work can technically be ever so slightly better than Random.value in regards to uniqueness, because technically the values are unique across the entire set. But you’d still be pulling non-unique values per element. Which honestly… I don’t know personally how much bias that would introduce, but as I stated in my original post in regards to that one… it’s more expensive than the solution Smooth-P posted both in time complexity, and just because generating a Guid is more expensive than generating a random value. But with that all said, where the guid falls short in this discussion of bias is that guids aren’t uniform across the set, they’re only unique.

Cannist · April 23, 2020, 7:38am

I’m with you there, That was my original point. (If values aren’t unique we only have a partial order)

the_real_ijed · September 16, 2021, 8:11pm

This will return a shuffled version of any list you point it at;

List<GameObject> tempList = shuffleGOList(yourList);

This is the method;

private List<GameObject> shuffleGOList(List<GameObject> inputList)
    {    //take any list of GameObjects and return it with Fischer-Yates shuffle
        int i = 0;
        int t = inputList.Count;
        int r = 0;
        GameObject p = null;
        List<GameObject> tempList = new List<GameObject>();
        tempList.AddRange(inputList);
     
        while (i < t)
        {
            r = Random.Range(i,tempList.Count);
            p = tempList[i];
            tempList[i] = tempList[r];
            tempList[r] = p;
            i++;
        }
     
        return tempList;
    }

So you’d add that first line in a method where you need a shuffled tempList and then access that as you need to.

I believe this is efficient, after reading the thread. Anything I missed please let me know

EDIT: Fixed the issue pointed out by Antistone in the method.

Topic		Replies	Views
Shuffling 2 separate list in the same order. Unity Engine Scripting	17	1998	December 8, 2018
Shuffle Elements in a List Unity Engine Scripting	3	5686	February 8, 2018
Random game object reshuffling. Unity Engine Scripting	60	7741	January 23, 2022
Random Number without repeat Unity Engine Scripting	46	52741	June 13, 2024
Making sure every object from a list has been used Unity Engine Scripting , Question	8	630	April 23, 2023

Clever way to shuffle a List<T> in one line of C# code

Related topics