Safeguarding the past, present and future of RISC OS for everyone

News | Downloads | Bugs | Bounties | Forums | Library

Forums → Bounties →

Floating point

8 posts, 5 voices

Jun 14, 2011 9:14pm Matthew Phillips (473) 721 posts	Could we have a bounty to do with exploiting floating point support in the new hardware? We probably need several things: Firstly a replacement FPE module for Beagleboard etc. which will reinterpret old-style FPA instructions and execute them on the VFP or NEON units (whichever gives more benefit). This would immediately give a speed benefit to any software using FPA instructions. This would include hand-crafted stuff, BASIC VI, and presumably C programs using floating point. I’m not sure what compiler and OS support is like for VFP or NEON, but once software starts being compiled to use these instruction sets there may be a need for a VFPE module for older hardware to make it easier for developers to support the wide range of machines still in use.

Nov 2, 2011 6:47am Matthew Phillips (473) 721 posts	Is no-one interested in this bounty proposal? It would be good to get wider exploitation of the hardware floating point in the BeagleBoard, rather than just a few demos. Is my analysis of the requirements way off the mark, or something?

Nov 2, 2011 1:47pm Jeffrey Lee (213) 6048 posts	Here’s my 2p: Firstly a replacement FPE module for Beagleboard etc. which will reinterpret old-style FPA instructions and execute them on the VFP or NEON units (whichever gives more benefit). Although I’ve suggested that we create a replacement FPEmulator before, I’m not actually sure how much of a performance benefit it would give. It could also take a lot of effort to make sure the new code is as accurate as the original code. BASIC VI I think the better choice for BASIC64 would be to modify it to use the VFP instructions directly. Unfortunately I think it’s possible that some programs use knowledge of the inner workings of BASIC to modify the state directly (e.g. in assembler routines) – so if BASIC64 suddenly switched from using FPA instructions and registers to VFP instructions and registers (and to storing the floating point values in little-endian word order instead of big-endian word order) then those programs could break. C programs using floating point. I don’t think we should worry too much about existing C programs. Any programs which make heavy use of floating point (or where floating point performance was a major performance issue) will have long ago switched to using GCC’s softfloat support, or to hand-written fixed point routines. Therefore any remaining programs will only be making light use of floating point, so wouldn’t see any significant gains from using a new FPEmulator. And any new programs which need to make heavy use of floating point should surely take into account the fact that all future machines will have VFP/NEON available – i.e. they should be using VFP/NEON by default (and relying on a VFPE module for running on old machines), or there should two (or more) different versions of the program available depending on the users machine type (not ideal, but it’s the only way you’d get the best performance for everyone). I’m not sure what compiler and OS support is like for VFP or NEON Basic OS support for VFP/NEON has been available for about a year now. Assembler support for VFP/NEON is pretty good (extASM, objasm & GCC 4.6 support the full ARMv7 instruction set, and BASIC is in testing). C compiler support is a bit lacking though – no announcement from ROOL as to when they’re aiming to add support for it to their tools (although it might be somewhat dependent on this bounty), and the patch I sent to the GCC team near the start of the year – that would have enabled full VFP/NEON support in C/C++ – doesn’t seem to have made it into their source repo yet (I should probably chase them up on that). but once software starts being compiled to use these instruction sets there may be a need for a VFPE module for older hardware to make it easier for developers to support the wide range of machines still in use. This is something I’ve suggested myself in the past as well, but it is a lot of work to create a full-blown floating point emulator, and the performance wouldn’t be as good as if a seperate non-VFP/NEON version of the program was produced. So it’s tempting to say that people should just deal with the fact that some machines will have VFP/NEON while others won’t. But until “proper” programs start appearing that make use of VFP/NEON we won’t really know if that approach is something programmers/users will be happy with.

Nov 3, 2011 6:51am Matthew Phillips (473) 721 posts	So perhaps we need a bounty for a “proper” program to be produced as a demonstrator of the possibilities. What would benefit most? Would FFmpeg or KinoAMP benefit from hardware floating point? Does anyone have any other suggestions. I just feel now we’ve finally got hardware floating point for the fisrt time since the ARM3 more should be done to exploit it.

Nov 3, 2011 8:22am Dave Higton (281) 668 posts	Would FFmpeg or KinoAMP benefit from hardware floating point? I’d have thought that FFmpeg, KinoAMP and the like would benefit most from using the DSP. I’m probably not a mainstream user, but I can’t think of much that would make significant use of FP at all, other than media decoding, the latter perhaps being best performed on DSPs with the main CPU getting the stream and putting the decoded result to the screen. I may be wrong.

Nov 3, 2011 2:06pm Jeffrey Lee (213) 6048 posts	I’d have thought that FFmpeg, KinoAMP and the like would benefit most from using the DSP. Yes, very true. Unfortuntaly DSP coprocessors are very machine-specific – although it might be possible to make the RISC OS side fairly generic, any code that runs on the DSP needs to be tailored to that specific machine. The DSP in the OMAP3 isn’t even compatible with the one in the OMAP4 (they’ve changed it from a generic fully programmable processor to a set of fixed-function components designed for decoding current video codecs like H264). Something else that would result in a large performance boost to movie players would be an API to allow access the YUV video overlay(s). Different machines might require slightly different pixel formats, but overall there’ll be much less variation compared to the DSPs, so it’ll be much easier to write code that will work with everything. This is something I might have done by now, if I wasn’t at a loss as to how to best extend GraphicsV (see here ). Maybe things will be a bit clearer once I/we find out the capabilities of the Raspberry Pi. But getting back to the topic of VFP/NEON… Yes, media encoders/decoders like FFmpeg and KinoAMP would benefit. FFmpeg will undoubtedly already contain code to use VFP/NEON, so all you’d need to do is change a few compiler options. KinoAMP will likely need more work, as I believe most/all of the existing code is hand-optimised assembler working in fixed point. Games will benefit as well. An obvious example would be Quake 2, but it also wouldn’t surprise me if a few of the ports on riscos.info would benefit from floating point, since they’ll all have been ported from platforms where hardware floating point was the norm.

Nov 3, 2011 3:24pm Trevor Johnson (329) 1645 posts	Yes, media encoders/decoders like FFmpeg and KinoAMP would benefit. What about DigitalCD? Or does that use MAD or some other integer algorithm?

Nov 3, 2011 3:50pm André Timmermans (100) 655 posts	Yes, media encoders/decoders like FFmpeg and KinoAMP would benefit. What about DigitalCD? Or does that use MAD or some other integer algorithm? I had a quick look at the FFmpeg sources, for NEON optimisations that could be relevant to KinoAmp or DigitalCD, and I only found a NEON version of the IDCT routine, so I guess the corresponding KinoAmp routine could be somewhat optimised. I suspect that KinoAmp would gain far from speed from having access YUV overlays or non-blocking I/O APIs.

Reply

To post replies, please first log in.

Forums → Bounties →

Search forums

Social

Follow us on

and

ROOL Store

Buy RISC OS Open merchandise here, including SD cards for Raspberry Pi and more.

Donate! Why?

Help ROOL make things happen – please consider donating!

RISC OS IPR

RISC OS is an Open Source operating system owned by RISC OS Developments Ltd and licensed primarily under the Apache 2.0 license.

Description

Discussion of items in the bounty list.

Voices

Options

Forums
Login

Contact Us | About Us

The RISC OS Open Beast theme is based on Beast's default layout
Site design © RISC OS Open Limited 2024 except where indicated

Hosted by Arachsys

Powered by Beast © 2006 Josh Goebel and Rick Olson
This site runs on Rails