Performance descriptions or wiki elaboration #58

cadop · 2020-08-05T23:58:19Z

I looked at the readme and wiki, but I don't think the performance is really covered that much besides some mentions of "efficient ray intersection finding". Would it be possible to elaborate on the performance characteristics of nanort? I found nanort from the issue on Embree not supporting double precision. One of the reasons I was starting with Embree was their paper on the high performance aspect of it, however for the scientific computing side the accuracy is also important.

Are there any benchmarks or even rough expectations for the difference between a single ray intersection with triangles in the BVH from nanort compared to Embree or the other raytracers?

syoyo · 2020-08-06T03:38:02Z

See #57 for rough estimates on the performance of NanoRT compared to Embree, but it is recommended to measure the performance on your side(and share the result is appreciated)

There is no OSS ray tracing library (except for NanoRT) which supports double precision as far as we know so it would be difficult how performant double-precision NanoRT is(in most case double-precision NanoRT is enoughly fast though).

cadop · 2020-08-06T10:33:25Z

Thanks, that was what I was looking for. Is the 3-4x slower referring to the double precision calculations? If so it sure seems to be a reasonable expectation compared to embree for double precision.

syoyo · 2020-08-06T11:23:08Z

@cadop 3-4x is for single precision.

cadop · 2020-08-10T23:25:40Z

I am still working on some more tests and checking if I can improve the way I implemented nanoRT, but here are my results so far in my own use case (mostly posting for reference for others, but also if the numbers make sense to you or not, should not be taken as a decisive metric). nanoRT using doubles, and obviously embree is floats. Times are only for the raycast loop (timer is started after BVH is created).

Using a model with ~1000 vertices, 40,000 rays cast in a loop (single core):

Embree: 3ms (13,333 rays/ms)
nanoRT: 13ms (3,077 rays/ms)

Using a model with 320,068 vertices, 40,000 rays cast in a loop (single core):

Embree: 8ms (5,000 rays/ms)
nanoRT: 69ms (580 rays/ms)

Same model as above, but with 360,000 rays. (I expanded the grid to cast rays, so there are some more that are possibly missing/hitting than in the previous case)

Embree: 73ms (4,931 rays/ms)
nanoRT: 684ms (526 rays/ms)

Using a model with ~1,000,000 vertices, 40,000 rays cast in a loop (single core):

Embree: 16ms (2,500 rays/ms)
nanoRT: 261ms (153 rays/ms)

So assuming I haven't messed up in my integration, it seems like the model size, is having a really big impact on performance compared to just increasing the number of rays. Would this suggest the performance difference is really about the BVH efficiency more than the double precision?

Settings:

Windows 10, Visual Studio 2019 MSVC x64 release
Intel Xeon CPU E5-2630 v4
Enabled C++11 features in define for nanoRT
Only using obj, not storing/using .mtl

syoyo · 2020-08-11T07:46:04Z

One of the performance impact would be memory bandwidth. Embree further uses quantized bounding box for BVH(at least curves primitives does) to reduce memory impact.

NanoRT always use double precision for BVH when using double as a template parameter. This may be overkill for normal usecase, but may be beneficial on HPC applications(e.g. CAD, astronomy)

Also, there is a room for efficient BVH build in NanoRT, especially implementing spatial split BVH: #15

jbikker · 2024-11-07T09:28:40Z

I did some initial performance comparisons between tiny_bvh and nanort, see tiny_bvh repo, speedtest application. Findings are strange: For a 150k triangle model, nanort is about three orders of magnitude slower. Am I doing something wrong in my nanort integration? I am closely following the approach shown in the obj viewer example.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance descriptions or wiki elaboration #58

Performance descriptions or wiki elaboration #58

cadop commented Aug 5, 2020

syoyo commented Aug 6, 2020 •

edited

Loading

cadop commented Aug 6, 2020

syoyo commented Aug 6, 2020

cadop commented Aug 10, 2020

syoyo commented Aug 11, 2020 •

edited

Loading

jbikker commented Nov 7, 2024

Performance descriptions or wiki elaboration #58

Performance descriptions or wiki elaboration #58

Comments

cadop commented Aug 5, 2020

syoyo commented Aug 6, 2020 • edited Loading

cadop commented Aug 6, 2020

syoyo commented Aug 6, 2020

cadop commented Aug 10, 2020

syoyo commented Aug 11, 2020 • edited Loading

jbikker commented Nov 7, 2024

syoyo commented Aug 6, 2020 •

edited

Loading

syoyo commented Aug 11, 2020 •

edited

Loading