The following thread is strictly for entertainment and giggles, so please do not take anything here too serious or too literal. There have been applied some very rough cuts in the corners in a mathematical sense, and a few things have been altered slightly to give a somewhat more concise representation. There are a few important things to keep in mind when evaluating the graphs. The data is a sampled set from about the last thousand threads in the collaboration forum only, and when referring to the users, it refers to the thread starters in that set only. These threads includes things like closed, edited and moved threads as well. All data have been normalized and you will see percentages of the sample set, rather than any absolute numbers. Note that since there is no direct metric for determining the success of a collaboration thread, all of the following information is made just out of curiosity and to determine trends and averages. Given this lack of a predicate they will not provide any direct conclusions, although indirect conclusions will be formed occasionally.
If we start by looking at threads that actually uses images to advertise their projects, against those that do not, we get that on average each thread contains a stunning 0.46 images. Now this is not a bad number at all, when you consider that some threads might just be people looking to join a project and in that regard has nothing to post. It also doesnât include any images buried under external links. We choose to exclude these, as getting members for your project is about instant attention catching, and posting a direct image will instantly portrait your concept and ideas â while a linked image does not. It should be noted that the maximum amount of images you are allowed to post is twenty. Now let us take a look at how many images the different threads uses and the distribution between threads with none, those with a few and those with many, we get these:
But hold on â what is happening here. We clearly had near half an image per post on average and yet, there is a vast majority of threads that contains no images at all. Clearly the average is getting pulled quite a lot up by threads with a lot of images. Now we could bore you with some statistical numbers like the variance, but the meaning of the number would be lost on the majority of the readers, so instead letâs look at a binary distribution on threads that has at least one against the threads that doesnât have any. This surprisingly shows that nearly 80% of threads have no images at all.
Now before we take a closer look at the actual content of these threads, let us take a moment and just look at who the people posting the threads actually are. If we look at how active the users are in terms of their posting count (excluding posts on Questions-&-Answers as that is counted separately to the forum post count), we get that on average each user has 68.4 posts measured at the time of the data collection. The high rollers in this regards in this reaches over 3000 other posts. As the first graph shows, an overwhelming amount of users that have very little other posts, and it has a very quick falloff. Compared to a normal forum post distribution this gets a much higher falloff, nearing a falloff rate of question related forums. This was not totally unexpected, and it is quite hard to read anything into the the data, even when focusing only on the first month on its own. However from the data, it shows that about 20% of thread starters has no other posts than their initial collaboration entry. Now we ask the question, how many users are actually active posters, and do a binary distribution on users with five or less posts against those that has above five. This gives a depressing view of nearly 45% of the thread starters are inactive forum users.
Now seemingly a user might be someone who registered a long time ago, and have been using the forum as a source of information or just never had a reason to post anything in the forum. It is a little unfair to label these as âpeople who just register to post a single thread then leave againâ so let us take a look at the time measured in months, since a user initially registered on the forum to when they made their collaboration thread. This tells us about how long a user have potentially been a member of the community, before trying to create a project. The light shines through, as we see people who have been members for years upon years before posting, as seen on the first graph here. Although our assumption from the post count before, that people are likely to just register and post, then leave, looks justified as nearly 20% of users creates their collaboration thread within a week after joining the forum. An interesting thing that we did not look at, was how many of these âinsta-postersâ were also single-posters, as there might be an fair overlap. However we choose to believe that the fact that 80% of users have at least a week on the forum also means they donât just register to post, although the first graph shows that most posts is still within a month of join.
Now with no better place to put this, letâs just go through it here. How active is the collaboration forum, is it increasing or decreasing and when does it peek. Now notable the data set generated were scraped in a backwards fashion from today and then going back in time, we might miss early trends. But if we look at the amount of posts per month, measured in months back in time, we can see that (we did intentionally not include any regression analysis here) the activity is pretty much stable over the last 18 months with averaging about between 100 and 200 new threads per month, with what could lightly be called an increasing amount.
Now this segment is pretty vague and non descriptive, as again there is no way to link any correlation between the values and averages to what makes a good post. Nonetheless, it was a metric that was really easy to include, even if it doesnât give any notable information. The following are graphs over thread title lengths and the word count of the thread content, measured to scale. Some of the very low counts are threads that have been altered to say âcloseâ or âfilledâ by the users, and have not been deleted by moderators.
The following is a series of different groupings of words from the thread contents, and their relative frequencies of use. Thereâs a lot to be said about this form of data analytic. People use different spelling, abbreviations, synonyms and misspelling, so a major step to give a somewhat accurate overview was to group up a lot of these into the same groups. This means that words as âC#â and âCsharpâ have been compiled into same group, and âmmorpgâ have been added to âmmoâ, as two examples. There are endless groupings you could do, and we choose to select a few that seemed reasonable and funny to show. The data from these are not normalized, so the same word being used multiple times in a thread will count multiple times. Considering the large amount of data set that we ran this on, there were lots of similar words that would fit into the group categories, but simply did not have enough counts to warrant adding them (someone actually did use the words lisp, fortran and cobol). Remember that word frequencies is a thing that is very hard (impossible) to get perfect, so read into them easily. The size of the graphs are the relative frequencies between the words shown. If anyone is interested in the full word list with counts, just contact me.
Now for some closing words, we lied, they wonât be about unicorns. If you have any comments or feedback, or metrics you want us to add to this, just throw me a message and we will look into adding them. Depending on how well this is received, we might do the same for the questions and answers forum, where we can actually measure the success as well. if you liked this thread, if you hated it or if you just thought it was cool that someone wanted to use their time to contribute to an amazing community, then give the post a like or send us your thoughts and most of all, enjoy using Unity!