diff --git a/Broadcom-Videocore-IV-Performance-Recommendations.md b/Broadcom-Videocore-IV-Performance-Recommendations.md index 288d7e2..38a70e8 100644 --- a/Broadcom-Videocore-IV-Performance-Recommendations.md +++ b/Broadcom-Videocore-IV-Performance-Recommendations.md @@ -14,6 +14,7 @@ Triangles that cover very few pixels (think less than 32) will be rasterized ver ## Tile based architecture The Broadcom Videocore IV GPU is a tile based GPU (but not deferred) therefore it's important to sort your geometry front-to-back to avoid any unnecessary overdraw. +Face culling can also help eliminating unnecessary rasterization costs. ## ALU architecture The Broadcom Videocore IV GPU has a dual-issue scalar FP32 ALU. This means that it can execute up to two instructions per cycle using its ADD and MUL ALUs. To maximize utilization it's important to fully saturate both ADD and MUL pipelines.