Author Topic: Regarding SSE Builds  (Read 30321 times)

0 Members and 1 Guest are viewing this topic.

Offline Zacam

  • Magnificent Bastard
  • Administrator
  • 211
  • I go Sledge-O-Matic on Spammers
    • Minecraft
    • Steam
    • Twitter
    • ModDB Feature
This topic is for the discussion related to released SSE/SSE2 Builds. (Release and Debug)

This post is a placeholder.
Report MediaVP issues, now on the MediaVP Mantis! Read all about it Here!
Talk with the FSU on #SCP-FSU Talk with the SCP on #SCP
"If you can keep a level head in all this confusion, you just don't understand the situation"

¤[D+¬>

[08/01 16:53:11] <sigtau> EveningTea: I have decided that I am a 32-bit registerkin.  Pronouns are eax, ebx, ecx, edx.
[08/01 16:53:31] <EveningTea> dhauidahh
[08/01 16:53:32] <EveningTea> sak
[08/01 16:53:40] * EveningTea froths at the mouth
[08/01 16:53:40] <sigtau> i broke him, boys

 

Offline Aardwolf

  • 211
  • Posts: 16,384
    • Minecraft
K, I'll start things off. What are they?

 

Offline chief1983

  • Still lacks a custom title
  • Moderator
  • 212
  • ⬇️⬆️⬅️⬅️À➡️⬇️
    • Minecraft
    • Skype
    • Steam
    • Twitter
    • Fate of the Galaxy
r u srs?

They are builds, with the SSE or SSE2 extensions enabled for fasting processing of certain types of arithmetic, optimized at the compiler level.  Beyond that, google it.
Fate of the Galaxy - Now Hiring!  Apply within | Diaspora | SCP Home | Collada Importer for PCS2
Karajorma's 'How to report bugs' | Mantis
#freespace | #scp-swc | #diaspora | #SCP | #hard-light on EsperNet

"You may not sell or otherwise commercially exploit the source or things you created based on the source." -- Excerpt from FSO license, for reference

Nuclear1:  Jesus Christ zack you're a little too hamyurger for HLP right now...
iamzack:  i dont have hamynerge i just want ptatoc hips D:
redsniper:  Platonic hips?!
iamzack:  lays

 

Offline jr2

  • The Mail Man
  • 212
  • It's prounounced jayartoo 0x6A7232
    • Steam

 

Offline Sushi

  • Art Critic
  • 211
Yikes, Wall o' Wiki!

Practical upshot is that SSE/SSE2 builds theoretically can improve your FPS.

 

Offline Topgun

  • 210
then why doesn't everyone use them?

 

Offline The E

  • He's Ebeneezer Goode
  • Global Moderator
  • 213
  • Nothing personal, just tech support.
    • Skype
    • Steam
    • Twitter
Because, until very recently, they have not been made regularly.
And they have a noticeable effect only in situations where the CPU is bottlenecking your performance.
**** every cause that ends in murder and children crying. ― Iain Banks
Join the fun at the HLP IRC channel. Get the latest spam and gossip as long as it's fresh!

 

Offline Herra Tohtori

  • The Academic
  • 211
  • Bad command or file name
Because, until very recently, they have not been made regularly.
And they have a noticeable effect only in situations where the CPU is bottlenecking your performance.

...Which, for most users with demi-recent GPU (GeForce 8 series or newer - even high-end 7 series models tend to do pretty well) is the limiting factor in most cases of slowdowns.

Particles and collision detection are done on CPU at least, hence you should have better performance with explosions, beams, weapons with particle spawn, complex models, very numerous models, very numerous weapon blobs flying around and probably some other things that I have no idea about should work faster with the newer instruction set support.

Basically it allows the game to use more advanced features of the central processor unit, which should allow faster data execution.


What sort of benchmarks have been done already? Massive Battle with FRAPS benchmarking the FPS? What sort of results the SSE2 builds actually deliver (and how do other optimizations affect things) in numbers game?
There are three things that last forever: Abort, Retry, Fail - and the greatest of these is Fail.

 

Offline Zacam

  • Magnificent Bastard
  • Administrator
  • 211
  • I go Sledge-O-Matic on Spammers
    • Minecraft
    • Steam
    • Twitter
    • ModDB Feature
Another thing to note about the potential benefit of SSE vs Regular builds is that, while FPS raises may not be greatly enhanced (Mura, for example, might see an average gain of 3-4 FPS), the over all _feel_ should be slightly smoother, and it's recovery from events that would drop FPS should be faster.

This should also hopefully result in smoother, more acceptable gameplay even if the over all FPS increase is minimal, as the FPS will also be handled by the rating of the GPU. For myself, because the CPU was able to do more (and I already had a beefy GPU) my FPS increase was fairly significant.
Report MediaVP issues, now on the MediaVP Mantis! Read all about it Here!
Talk with the FSU on #SCP-FSU Talk with the SCP on #SCP
"If you can keep a level head in all this confusion, you just don't understand the situation"

¤[D+¬>

[08/01 16:53:11] <sigtau> EveningTea: I have decided that I am a 32-bit registerkin.  Pronouns are eax, ebx, ecx, edx.
[08/01 16:53:31] <EveningTea> dhauidahh
[08/01 16:53:32] <EveningTea> sak
[08/01 16:53:40] * EveningTea froths at the mouth
[08/01 16:53:40] <sigtau> i broke him, boys

 

Offline Sushi

  • Art Critic
  • 211
So when are these optimizations to SSE builds I've been hearing about going to make it into trunk?

 

Offline chief1983

  • Still lacks a custom title
  • Moderator
  • 212
  • ⬇️⬆️⬅️⬅️À➡️⬇️
    • Minecraft
    • Skype
    • Steam
    • Twitter
    • Fate of the Galaxy
I wasn't aware of any optimizations to the SSE builds themselves, just that there have been some SSE-enabled (and therefore optimized) builds made available recently.
Fate of the Galaxy - Now Hiring!  Apply within | Diaspora | SCP Home | Collada Importer for PCS2
Karajorma's 'How to report bugs' | Mantis
#freespace | #scp-swc | #diaspora | #SCP | #hard-light on EsperNet

"You may not sell or otherwise commercially exploit the source or things you created based on the source." -- Excerpt from FSO license, for reference

Nuclear1:  Jesus Christ zack you're a little too hamyurger for HLP right now...
iamzack:  i dont have hamynerge i just want ptatoc hips D:
redsniper:  Platonic hips?!
iamzack:  lays

 

Offline Zacam

  • Magnificent Bastard
  • Administrator
  • 211
  • I go Sledge-O-Matic on Spammers
    • Minecraft
    • Steam
    • Twitter
    • ModDB Feature
Octually, SSE as a build type has been available in the MSVC_2008 project for awhile. But these builds are based of a private selection of flag and options changes to those projects that do increase SSE/SSE2 performance over what is currently put out when left in the current public settings.

And they will make it in to trunk as soon as I am satisfied that no other possible combinations exist to enhance performance and once frequent enough testing has taken place that proves the stability of these builds.

I'll the submit the .patch and the additional files I have place into my working dir for consideration, and should nobody find any issue with there being any conflictive settings that the compiler is too stupid to catch that I'm not aware of, it'll likely make it in to a trunk commit hopefully before too long.
Report MediaVP issues, now on the MediaVP Mantis! Read all about it Here!
Talk with the FSU on #SCP-FSU Talk with the SCP on #SCP
"If you can keep a level head in all this confusion, you just don't understand the situation"

¤[D+¬>

[08/01 16:53:11] <sigtau> EveningTea: I have decided that I am a 32-bit registerkin.  Pronouns are eax, ebx, ecx, edx.
[08/01 16:53:31] <EveningTea> dhauidahh
[08/01 16:53:32] <EveningTea> sak
[08/01 16:53:40] * EveningTea froths at the mouth
[08/01 16:53:40] <sigtau> i broke him, boys

 

Offline Aardwolf

  • 211
  • Posts: 16,384
    • Minecraft
Re: Regardin' SSE Builds<p/><font color=\
I shall try this and see if it even works on me computer.   :p

Edit (stupid pirate script is making me de-piratify my original post):
Code: [Select]
Assert: !resize
File: gropengltexture.cpp
Line: 622

<no module>! KiFastSystemCallRet
<no module>! WaitForSingleObject + 18 bytes
<no module>! SCP_DumpStack + 260 bytes
<no module>! WinAssert + 208 bytes
<no module>! opengl_create_texture_sub + 2711 bytes
<no module>! opengl_create_texture + 998 bytes
<no module>! gr_opengl_tcache_set_internal + 217 bytes
<no module>! gr_opengl_tcache_set + 137 bytes
<no module>! opengl_render_pipeline_fixed + 1895 bytes
<no module>! gr_opengl_render_buffer + 232 bytes
<no module>! gr_render_buffer + 58 bytes
<no module>! model_render_buffers + 1947 bytes
<no module>! model_really_render + 2816 bytes
<no module>! model_try_cache_render + 55 bytes
<no module>! model_render + 676 bytes
<no module>! labviewer_render_model + 2253 bytes
<no module>! labviewer_do_render + 147 bytes
<no module>! lab_do_frame + 149 bytes
<no module>! game_do_state + 1453 bytes
<no module>! gameseq_process_events + 237 bytes
<no module>! game_main + 728 bytes
<no module>! WinMain + 330 bytes
<no module>! __tmainCRTStartup + 358 bytes
<no module>! WinMainCRTStartup + 15 bytes
<no module>! RegisterWaitForInputIdle + 73 bytes

Dunno if it's related to the SSE(2) tho. I'll try with a normal build and compare.
« Last Edit: September 19, 2009, 04:23:44 pm by Aardwolf »

 

Offline The E

  • He's Ebeneezer Goode
  • Global Moderator
  • 213
  • Nothing personal, just tech support.
    • Skype
    • Steam
    • Twitter
Is there a full debug log?
**** every cause that ends in murder and children crying. ― Iain Banks
Join the fun at the HLP IRC channel. Get the latest spam and gossip as long as it's fresh!

 

Offline Aardwolf

  • 211
  • Posts: 16,384
    • Minecraft
I just mantis'd this, it's issue #1994. I'll upload a debug log there in a minute.

 

Offline Zacam

  • Magnificent Bastard
  • Administrator
  • 211
  • I go Sledge-O-Matic on Spammers
    • Minecraft
    • Steam
    • Twitter
    • ModDB Feature
The hell?

That is, uh, rather interesting. Does that happen to both SSE builds (SSE and SSE2)? Is it Debug or Release?
Report MediaVP issues, now on the MediaVP Mantis! Read all about it Here!
Talk with the FSU on #SCP-FSU Talk with the SCP on #SCP
"If you can keep a level head in all this confusion, you just don't understand the situation"

¤[D+¬>

[08/01 16:53:11] <sigtau> EveningTea: I have decided that I am a 32-bit registerkin.  Pronouns are eax, ebx, ecx, edx.
[08/01 16:53:31] <EveningTea> dhauidahh
[08/01 16:53:32] <EveningTea> sak
[08/01 16:53:40] * EveningTea froths at the mouth
[08/01 16:53:40] <sigtau> i broke him, boys

 

Offline Aardwolf

  • 211
  • Posts: 16,384
    • Minecraft
I don't think it's actually SSE-related. It seems to just be debug builds in general.

Here's the link to the issue on Mantis: clicky

 

Offline Zacam

  • Magnificent Bastard
  • Administrator
  • 211
  • I go Sledge-O-Matic on Spammers
    • Minecraft
    • Steam
    • Twitter
    • ModDB Feature
Seems to be the CubeMap.

Previous versions of the cubemap where mip-mapped. WoolieWools cubemap is not.

I resaved the cubemap to have mip-mapping, since I think the error is in !resize function of trying to create the mip maps dynamically through the -mipmap flag. Which is strange because it should be barfing on a lot of the other non-mip mapped effects files if that were the case.

Try this cubemap: http://www.mediafire.com/file/mqo01mgjlyn/CubeMap.dds

Place in MediaVPs\data\effects. If that fixes the problem, I'll commit it to the MediaVPs SVN.
Report MediaVP issues, now on the MediaVP Mantis! Read all about it Here!
Talk with the FSU on #SCP-FSU Talk with the SCP on #SCP
"If you can keep a level head in all this confusion, you just don't understand the situation"

¤[D+¬>

[08/01 16:53:11] <sigtau> EveningTea: I have decided that I am a 32-bit registerkin.  Pronouns are eax, ebx, ecx, edx.
[08/01 16:53:31] <EveningTea> dhauidahh
[08/01 16:53:32] <EveningTea> sak
[08/01 16:53:40] * EveningTea froths at the mouth
[08/01 16:53:40] <sigtau> i broke him, boys