Isadora movie player performances with respect to VLC
Hi,What's your codec ?Would you give a try with threaded movie playback disabled ?Would you give a try playing back all the movies on only one single stage ( using zoom & position inputs of the projector to but them side by side ) ?Thank you.Mehdi
Dear ikse,Just to be clear -- these are WMV or AVI movies, yes?Best Wishes,Mark
Dear Keftaparty, dear Mark,
These were WMV movies with basic Windows VC1 codec. Please find attached a more detailed description of the video and audio formats.
The codec is not really an issue for me as it is a native Windows codec and as VLC reads it smoothly. Some time ago, I did run rather extensive tests on the best codec to be played with Isadora - I should have posted it but I was still struggling with CPU issues. In summary, as codecs are concerned, I installed the ffmpeg library and found out that VC1 competes fairly well with other available codecs when the latter were performed natively by windows (through ffmpeg). Of course, checking the Plays Windows Movies Formats natively was essential. Otherwise decoding was not carried as a multithreaded process and Isadora freezes. I did try mp4 but I could not have them decoded by ffmpeg. Quicktime take care of them - not really efficiently on Windows. So I had to have h264 codec embededded in avi containers, which worked fine. Last week-end, to make things plain, I fully reinstalled Windows then I only installed Quicktime, Isadora, and VLC to run the late tests - a very clean bench test, I believe.
I have run the tests, Keftaparty, you suggested. Please find attached the corresponding screenshots. In summary, the "threaded movie playback disabled" and the "all movies on one stage" did not change anything I could notice. I did try while hidding the stages: It really frees the core Isadora loads. I guess it does represent the work Isadora has to handle to send the video data to the GPU. Any suggestion to reduce the load would be welcome.
What still remains is the cpu load difference between Isadora and VLC while handling video fluxes. I would be happy to understand where the issue is and if it can be solved. I would be happy to hear that I am doing things the wrong way.
Thanks again for your help,
Hi Ikse,I personnaly never had very nice experiences with WMV ( I maybe never found the good "tunings" ).I use AVI files, and the blackmagic codecs ( you can get them by installing their desktop video software). I manage to handle 8 layers of 1080p in isadora ( with SSDs of course ).It would be interesting to compare isadora to windows media player. Because I think (hope) that it uses the native video playback of windows.Comparing Isadora to VLC is a bit complicated, as VLC has it's own engine to decode videos ( witch is quite powerfull ). And isadora is not using this engine.Also one curiosity question: with such powerfull workstations, why do you want to avoid using your CPUs ? They have to get warm !One more 2 cents advice: I found sometimes isadora performing better when setting the target fps to 60 ( like my monitors ).Also, it's really great that you provide such good and well documented tests ! Thanks.BestMehdi
Thanks for your feedback. I agree it is definitely possible to run that many HD video fluxes with Isadora with blackmagic codecs, or even WVC1, or h264. I ran some initial tests before reinstalling the OS with various codecs. The issue is that, as soon as one of the cores saturates, Isadora freezes. The project is not only dealing with video streaming to six monitors but it has a lot of expected real time interactions with the passage of seven tracked people. Hence I need to spare as much CPU as I can to allow for real time to happen - mostly CPU on the core that Isadora uses.
I know the comparison with VLC is not totally fair but I believe only one instance of Windows media player is possible. I also ran tests with the Classic media player and the comparison was not favorable to Isadora. I can rerun then. Sorry, I was just reluctant to reinstall too many things on the machine so far.
Best from Paris,
P.S.: Despite every things one can have against microsoft stuffs, VC1 is pretty light both in terms of filesize and cpu decoding. I guess I will reinstall every codecs, rerun and record the tests to put things down.
P.P.S.: The nice thing with ffmpeg is that you can make sure that it is effectively ffmpeg which does the decoding job as you can allow for icons to pop out when it is working! It works well. Multi-threading is just great then.
I have no experience in tracking.Is it so process heavy ?What video sources and capture material do you use ? This can make a big difference.What about running it on a non rendering machine and send results to your rendering machine with osc ?Would it be possible to run multiple instances of isadora ?Windows should distribute the processes on the cores ( am I wrong ? ).For tracking etc... two instances can communicate with osc.Mehdi
You are right again. The tracking in itself is not that heavy. It is actually confined in one of the cores of the computer and performed by a home-programmed script, which sends via osc the position (x,y,z) and velocity (Vx,Vy,Vz) of seven tracked heads using 2 kinects. The heavy part comes when the video streams must be processed according to these parameters independently and according to different scenarii: 7 local displace given by local 3D projector, which position is given by (x,y,z) , generation of associated 3D objects. So the machine we are talking is the rendering one. The first attempts with a low resolution video looked like that www.la-lumiere-ne-s-arrete-pas-la.org
It is possible to run multiple instances of isadora but the last attempt was not very successful when dealing with high definition video streams - it crashed too quickly. I did not see any distribution over the cores. I should certainly give it another try.
Dear All,I'm in the midst of the workshop in Munich. I'll need some time to answer these questions with any level of intelligence. I've bookmarked this discussion so I can come back to it next week. Sorry for the delays.Best Wishes,Mark
Thanks for the forthcoming reflections on the subject - hot and burning for us!
Enjoy the workshop in Munich!
P.S.: If you happened to pass by Paris on your way back, I would be happy to welcome you at the neticLab www.neticLab.org
Hi Ikse,Are you manipulating the images only with the projector actors ( 3d quad distort etc... ) ?Are you using any other actors to manipulate the images ?Are you using 3d renderers ? If yes at witch resolution ?Can you post a screenshot of the stage tab of your isadora preferences ?Can you post a screenshot of your nvidia's driver screens configuration ( The "configurere Surround et PhysX" tab would be perfect for what I want to see... )?If you use only projector actors, you should not have any performance problems ( they are gpu based ).On windows, Isadora was a bit capricious when using multiple stages on multiple monitors.I see you have 3 x quadro cards, are they SLI or are they independant ?Mehdi
I have two main superactors:
1. The first one displaces a large resolution background movie with seven 3D rect Projectors sent to a single channel to be rendered by a 3D renderer. Ideally the resolution of the seven movies and the 3D renderers should match the resolution of the background movie to keep the latter resolved (minimal targeted matrix : 2000x1600). The vertical size of 3D renderer is anyway limited to 768. It obviously freezes when approaching a decent resolution (if I remember well it saturates for something like 512x512).
2. The second one generates seven 3D objects (sphere for now) on the same large resolution background movie with seven 3D players sent to a single stage to be rendered by a 3D renderer. It works fine until I try to play either the background movie or project movies onto the 3D objects. The target obviously is to play both at the same time.
I would agree with you on the expected performances with projectors. However, the projectors might be gpu-based but I guess the associated movie players take over the cpu load. So it freezes even in the second case described above.
The three Quadro cards are independant.
I will post the screenshots and come back to you as soon as I go back to the neticLab, hopefully later today.
Have a nice day!
Xavier,I think one problem here is the 3d renderer. It's performance eating on heavy resolutions.Also, when working with 3d objects, you need to take care to some facts:-It's "complicated" to share the same 3d object to multiple graphic cards, even on multiple "heads" of a same graphic card.I've had some issues with that kind of setups in the past.If you want we can skype ( in french ). Send me a private message with your skype name if you want.Mehdi
I have arrived later than expected. Please find below the screenshots. I have not looked into the setting of the graphic cards. Any improvement is certainly welcome.
However, I don't think it is an issueso far for the 3D objects as with the seven of them, it runs smoothly. It is true that there were shared between graphics cards and within a card as they travel through three videoprojectors (2x1280800+1x12801024).
A Skype would be great to sort this out when you have some time.
Meanwhile I will post a late series of tests on cpu loading.
For a fairer comparison, I have installed K-Lite_Codec_Pack_930 for 32 and 64 bits. It comes with the Media player classic (MPC). It is then possible to run both Isadora with six movie players and six projectors and to compare it to six MPC while making sure that they both use the same decoding process (CPU or GPU hardware accelerated).
I tried several codex configurations (still for the same WVMC1 encoded HD movie). Please find the outcomes in the series of screenshots below (the whole series is compressed in the last file).
The file names are self-explicit. However, in summary, it appears that:
- There is a substantial gain to make use of the ffdshow decoder (the total CPU load falls from 39% down to 18%)
- Isadora does not stand Nvidia Cuvid (GPU acceleration) but runs with DXVA2copyback but without any gain
- Showing the stages cost quite a bit in CPU load (the total CPU load falls by half again from 18-19% down to 10-12%) which is pretty surprising for me. At the same time, the GPU load is substantially reduced (figure again)
- Confining Isadora to 8 cores does not change the overall CPU load - as one can expect
- Native codex 38-39%
- FFv codex 18-19%
- Nvidia Cuvid -> Crashes
- DXVA2copyback 18-19%
- FFv codex + Hidden stages 10-12%
The overall CPU load is still much less than Isadora
- Native codex 8%
- FFv codex 7%
- Nvidia Cuvid 6%
- FFv codex + Nvidia Cuvid 6%
As it limits our developments, I would be happy to understand why there is such a need of CPU power for playing videos with Isadora and if there are ways to go around. The critical states are obviously exemplified in the screenshots with DXVA2copyback where there is one core (the one that Isadora uses I guess) which is overloaded. It is close to saturate and Isadora, to freeze.
Again, I am hopefully doing something wrong here. The system is as simple as can be with only the required softwares and codex to run the tests (+ the monitoring gadgets).
Thanks for your feedback,
Dear All,Thank you for this continued, very detailed research. I forgot to say that, after Munich, I was going to the USA, where I am now. But, my travels are almost over I return to Berlin on Monday. Then, finally, I can properly digest and look carefully at all you have to say.Best Wishes,Mark
Did you happen to take note as to wheather or not any of these codec option allowed for looping. I have been doing a number of windows native + Qt HD test as well (not as well documented) and a big problem for me with most windows options is that the video can not be looped.
Looping is not an issue with the VC1 codec. It loops indefinitely under Isadora. I did face some looping issues with lossless avi-contained codec - namely there was no end trigger, so no looping possibility. The player could not figure out that the video had reached its end, whereas, surprisingly enough, it was possible to get its time duration with Media time to percent so I did generate the trigger with a time count down... not very clean.
As it appears more and more clearly to me now, the codec is not that an issue within Isadora. It can be circumvented by the Plays Windows movie formats natively so Isadora does perform as well as another player. At least the movie player actor does conform to this. The problem seems to lie in the projector actor.
I'll come with additional tests on this.
Dear Mehdi, Mark, DusX,
Following some Mehdi's advices, please find attached some last results that answer two questions:
- Projecting on a full monitor screen costs more cpu load (mostly onto the core Isadora uses) than projecting on a preview screen. The gpu load is barely affected.
- Projecting several movies on a single preview screen or on separate preview screens do not change the cpu load nor the gpu load
One could infer from these tests that Isadora runs some additional cpu calculations on the initial image matrix whatever the targeted screen may be. The purpose of such calculations are rather unclear to me as they occur only when the movie player actor is connected to the projector actor whereas the process should be handled by the gpu only.
I'll hopefully come with some more later today.
Best regards from Paris,
Perhaps this issue is related?
Have any of you tested with the movie player direct?
Dear DusX, Mehdi, Mark,
I did try with the movie player direct and it did not improve the performances. I am convinced now that the limitations come from the projector itself.
I ran another series of comparisons within Isadora this time:
1. Again, the preview seems to be lighter on the cpu than a full screen projection:
- it is clear and expected between a 1920x1080 monitor and a 320x240 preview window
- it is clear but much less expected between an 1280x800 monitor and a 1280x800 preview window (the Isadora core is at 44% and 27% for a total cpu usage of 3 and 4% respectively)
2. Blackmagic codec works fairwell as long as the hardrive bandwidth can keep up with the load of data to be transfered (it is certainly a reason for the difficulty of looping mentioned above... the time is over but there are still some jetlagged frames coming in). It actually works very very well with SSD and ffmpeg together. Decoding is very light and well spread on the cores. However the core Isadora uses reaches 100% with four HD movies.
3. Projector, 3D Quad distort, 3D Rect project actors behave more or less the same way in terms of cpu load: 86%, 77%, and 89% on the Isadora core, 9%, 10%, and 12% overall respectively.
4. Projector actors take nearly half of the cpu load as, when removed, the overall usage falls down to 7% and the Isadora core is lowered to 36%.
The last point is striking as one may expect the projector actors to be relatively free from the cpu and handled by the gpu. In summary, I should say that the movie decoding is properly handled and it is only limited by the local software (codecs) and hardware (hardrive access and cpu power). In contrary, the display (projector actors) does not seem to work the way it should and weight too much onto the cpu (essentially onto the core Isadora uses) for large resolution movies (it is rather transparent for standard resolution movies) whereas it is expected to be gpu-handled.
I hope there is reason and a solution behind. I might still be doing things the wrong way and I will be happy to correct that.
Thanks for your help.
P.S.: Mark, is there any reason for limiting the vertical size of the 3D renderer today?