Proposed GraphicsV enhancements

307 posts, 36 voices

Pages: 1 2 3 4 5 6 7 8 9 10 11 12 13

Mar 2, 2014 1:58pm Steve Pampling (1551) 8170 posts	Yes, those are weighting values (PRM 3-339). Surprisingly the APIs to control the values are all “for internal use only”, which makes me wonder if anything actually modifies them Poke nemo to wake him up and there may be an opinion on what should be in place. mind you you did mention the magic “gamma curve” phrase: It basically already is an approximation, when you consider that it’s operating in RGB colourspace with an unspecified gamma curve :) Me?, I’m just happy seeing things happen faster.

Mar 2, 2014 7:03pm Jeffrey Lee (213) 6048 posts	Other improvements could come from […] an implementation of ROL’s tiled OS_SpriteOp. Even without GPU accelerated sprites the tiled OS_SpriteOp could be a big help – the implementation could just draw the sprite once and then use the existing rectangle copy GraphicsV operation to copy it to all the other tiles. I now have a rough prototype of this in BASIC, so expect to see it soon!

Mar 3, 2014 7:52am Jon Abbott (1421) 2651 posts	“(r12 + g12 + b12) – (r22 + g22 + b22) != ((r1-r2)2 + (g1-g2)2 + (b1-b2)2)”, it did in my head yesterday morning! LOL “(abs(r1-r2) + abs(g1-g2) + abs(b1-b2))” that’s manhattan distance and is a useful approximation. Might be worth trying on a 256 palette to see what the result is like. I did consider a tree structure, but thought it was probably going to be slower for 256 values. Perhaps the many variants of the equation need to be centralised and a user controlled weighting put into the Screen configuration. Considering the amount of MUL’s going on, would a 256 entry SQR table help? It should all fit in the cache although would be slow initially. Screen caching will make a big difference, the “chocolate” code is still in the source from memory. The OS_ScreenMode to turn it off doesn’t work though, not in RO4 at any rate and I doubt it’s been touched since then. IIRC it put the screen memory into another domain and watched aborts to know if a cache flush was needed at VSync – a nasty hack to stop all the visual problems screen caching causes.

Mar 3, 2014 1:23pm Jeffrey Lee (213) 6048 posts	Considering the amount of MUL’s going on, would a 256 entry SQR table help? It should all fit in the cache although would be slow initially. My gut says it’ll be about the same, but I haven’t looked at the instruction timings to be sure. One downside to using a lookup table is that you’d have to find a spare register to hold its address, which could be tricky for some of the code.

Mar 3, 2014 1:52pm Jon Abbott (1421) 2651 posts	Pre-weight three SQR tables for R,B,G and drop the loading registers to remove 3 MUL’s and get back 3 registers? You’d probably have to drop the table to 8bit though to keep the size down and increase the cache hit ratio, which would band low intensity colours. You could use 16bit values, but a 1.5kb table is probably pushing it speed wise, 6 MUL’s may be quicker than 3 LDRH’s due to the cache hit. “(abs(r1-r2) + abs(g1-g2) + abs(b1-b2))” might be a good compromise.

Mar 4, 2014 10:04pm Jon Abbott (1421) 2651 posts	Speaking of weighting, the ColourDistance function near the bottom of the page below looks like an interesting approximation of Luv and might be worth investigating: http://www.compuphase.com/cmetric.htm

Mar 9, 2014 10:08pm Jeffrey Lee (213) 6048 posts	My implementation of the tiled OS_SpriteOp is now in CVS, along with Wimp and Pinboard updates to make use of it where possible. On the Iyonix and Pi it’ll make use of hardware acceleration, resulting in significant performance gains (particularly for the Iyonix – redrawing an empty desktop in 16M colours now takes 4cs instead of 50cs, and the “window full of files” test in 16M colours has dropped from 70cs to 20cs). However on OMAP (or at least on a BB-xM) I found that the hardware acceleration made things slower – the DMA engine/memory bus just doesn’t seem to be designed for high-throughput transfers. So on OMAP (and IOMD) it avoids trying to use hardware acceleration and just renders all the sprites manually. I also found a couple of bugs with sprite rendering in general, which was causing some common sprite operations/types to render significantly slower than they should.

Mar 9, 2014 10:33pm Sprow (202) 1158 posts	tiled OS_SpriteOp […] Pinboard updates Should the “EXIT VC” be outside the “standalonemessages” switch, otherwise it’d plot twice, once via the tiled sprite op then again by falling through (in the ROM case)?

Mar 9, 2014 11:26pm Jeffrey Lee (213) 6048 posts	Well spotted! For the ROM case it was actually meant to be exiting immediately, under the assumption that OS_SpriteOp 65 would always be available and therefore retrying with a different sprite op would be futile and result in the same error. But you’ve also lead me to a deeper issue – the code was relying on r1 being a pointer to the redraw block, despite the source comments indicating that the function takes no arguments. So for the ROM case it looks like it was successfully drawing using OS_SpriteOp 65 and then skipping the manual redraw loop due to using a duff redraw block pointer.

Mar 10, 2014 11:06am Rick Murray (539) 13840 posts	However on OMAP (or at least on a BB-xM) I found that the hardware acceleration made things slower – the DMA engine/memory bus just doesn’t seem to be designed for high-throughput transfers. Oh, well… That’ll be TIs wonderful graphics capabilities. <mumble> <mumble> <mumble>

Apr 6, 2014 8:07pm Jeffrey Lee (213) 6048 posts	Doug: Re: Jeffrey’s submission 15th Dec. Iyonix: FX-series cards will now advertise 32K colour modes as being red/blue swapped. Also added support for 64K colour modes. I’ve only tested this with an FX 5500 card, so if people could test 32K colour modes and 64K colour modes on other card types then that would be appreciated, as there are a few bits which need to be done differently depending on the card architecture. I have tried it on my Iyonix with a Gainward Pro660 Fx5200 128MB card fitted and with Geminus installed and I get a rather funky and garbled screen when trying 32K and 64K colour screens with the 2nd Jan 14 ROM and HardDisc image installed. Removing Geminus and rebooting allows the 32K and 64K screens to work correctly. I’m happy to continue without Geminus, unless someone knows how to disable the built in RGB swapping as I believe I also lose some screen acceration options without it as well. I’ve now added a new Configure item, Configure NVidia. This will show up in tomorrow’s ROM and allow you to control the red/blue swapping that the driver/OS is performing. E.g.: Configure NVidia -auto Enable auto selection (the default) Configure NVidia -manual Manual setting, with no OS/driver red/blue swapping. I.e. this is what you would have been using prior to Dec 15th Configure NVidia -manual -swap8bpp -swap16bpp -swap32bpp Manual setting, with all colour depths red/blue swapped. 8bpp modes can be swapped fully by the driver, but 16bpp & 32bpp will be using the new screen modes, so may run into compatibility issues with old software. But software compatibility aside, it should basically do the same thing as Geminus’s red/blue swapping facility and allow you to use any old card without having to worry about whether it needs a hardware mod or not. The settings are stored in CMOS on a per-card basis, so if you’ve got multiple cards which need different settings you can find the PCI device number (the first column in PCIDevices) and then do something like ‘Configure NVidia -device 123 -auto’ to change its settings (Configure NVidia without specifying the device will set the settings for all cards). Also note that new settings will only take effect after a mode change. At some point I’ll be updating the screen setup plugin in !Configure to allow these settings to be controlled, but I’m not sure if that will be before or after RISC OS 5.22 is released. However there shouldn’t be any more ROM changes needed to get the plugin working, so it doesn’t matter too much if it’s only available once 5.22 is released.

Apr 20, 2014 12:10pm Doug Webb (190) 1180 posts	Hi Jeffrey Thanks for the update. With my non hardware swapped colours card I get the following: With it set to -manual then with Geminus loaded 256, 16M colours are OK but 64K colours have a funky look still and 32k are swapped. With the manual additional -swap options then I correctly get 256 colours swapped but as expected 64K is still garbled/funky colours and 32K and 16M are swapped colours but the jpeg background is garbled in monochrome colours. So I have a number of options due to the update but I think I’ll stick with built in swapping as it seems more consistent with my non hardware modified card though I lose the jpeg accelerations. Thanks once again for the update and in general for the great additions you have made to RISC OS.

Jun 14, 2014 9:54am Jeffrey Lee (213) 6048 posts	As requested quite a while ago, I’ve now updated the GraphicsV documentation with a summary of the contexts under which each driver entry point may be called. Most of this is based around how the OS calls the drivers – basically anything which the OS calls from an interrupt handler is flagged as a background call, and all the others I’ve flagged as being foreground-only in order to ease driver implementation. Let me know if you spot any glaring errors, or think things could do with being explained a bit clearer.

Jun 14, 2014 10:44am Malcolm Hussain-Gambles (1596) 811 posts	Thanks for that Jeffrey, I read your post thinking “yeah, who’s going to be able to actually read and understand that ****”. Suprisingly enough I’m one of them, although I’ll admit I don’t understand how it all hangs together – which is entirely my lack of time to bother reading. I’ll add it to my ever expanding list/encyclopedia of things to read & understand. But excellent documentation! Really appreciated, thankyou.

Jun 14, 2014 12:26pm Jon Abbott (1421) 2651 posts	Looks good. My only suggestion would be to detail if IRQs are enabled/disabled during the calls. What caused me lots of issues was the fact GraphicsV 2 executes with IRQ’s enabled – games were crashing when writing to screen memory whilst it was being remapped, if they updated the screen via an IRQ.

Jun 15, 2014 10:33am Jeffrey Lee (213) 6048 posts	My only suggestion would be to detail if IRQs are enabled/disabled during the calls. Done. Basically: Foreground calls ‘should’ always be called with interrupts enabled. I think this will currently always be the case, but since the OS generally preserves whatever the caller’s interrupt state was then there are always bound to be a few exceptions. e.g. I don’t think there’s anything stopping you from disabling interrupts and then doing a mode change. But under the rules I’m setting here, there’s nothing stopping the driver from explicitly enabling interrupts if it needs to do some long blocking operation. Background calls (or hybrid FG/BG calls) may be called with interrupts enabled or disabled, the driver shouldn’t make any assumptions about the state. But if interrupts are disabled on entry then the driver shouldn’t re-enable them.

Jul 3, 2014 1:12am Jeffrey Lee (213) 6048 posts	There are a couple of issues coming to light due to the EDID bounty that could do with some discussion here. Interlace revisited At the moment we don’t really have any drivers which honour the interlaced flag properly, and in the past I’ve been pretty confused as to why we’ve got two ways of specifying interlace – one via control list setting, and one via the sync polarity flags (https://www.riscosopen.org/forum/forums/3/topics/309#posts-23312). With us about to start pulling interlaced mode definitions from EDID we need to come up with some hard and fast rules as to how interlace should and shouldn’t be handled. E.g. if a display supports 1080i but not 1080p (or vice-versa) then we need to make sure the driver honours the interlace flag it’s given and doesn’t start doing its own thing like doubling the pixel rate and vertical timings to convert an interlaced mode to a progressive one. As far as I can see, we need to decide what to do about the following: The interlace setting made by TV, OS_Byte 144, and VDU 23,0,8. As far as I can tell these have never worked in a sensible manner (they’d influence the interlace flag in the VIDC control register, but nothing would make any attempt to program VCR correctly). I’m all for getting rid of them, as I can’t think of any situation where you wouldn’t be better off specifying interlace via MDF entries instead. How the interlace setting should be specified in VIDC lists. As mentioned above, there are currently two ways of doing this – via the sync polarity flags or via a control list item. The syncpol flag is currently tied to the TV setting, while the control list item is tied to the interlace flag in the EDID/MDF. So if we ditch TV it makes sense to stick with the control list item, especially since the control list item is what ScreenModes is already using. (Yes, this is the opposite of what I was proposing earlier in this thread!) EDID extension blocks* To read anything beyond the first 256 bytes of EDID you need to do an IIC write to a device at a different address to the main EDID EPROM in order to select the bank that the EPROM will return for future reads. We don’t have any clear way of handling this with the current GraphicsV IIC API – if it becomes the driver’s responsibility to write to the bank register then it means that the driver will end up doing much more than just accessing the IIC address that the caller provides it. On the other hand, if it’s the caller’s responsibility to write to the bank register then there’s the possibility for conflict if two callers try to access different banks. The lower-level OS_IICOp API doesn’t have this problem as you’re able to provide your own sequence of bus transactions which other callers won’t be able to interrupt. But with the GraphicsV API the only operations that are possible are the basic write-read or write-write sequences that are used to access the EPROM-like device that’s assumed to be on the other end of the bus. Personally I’d like to see the current API replaced (or extended) to support a lower-level OS_IICOp-like API. But as we currently make light use of GraphicsV IIC ops I’m happy with a quick fix interim solution of making it the caller’s responsibility to set the bank register correctly. Anyone else have any thoughts on the above two topics?

Jul 3, 2014 9:29am Dave Higton (1515) 3526 posts	I have experience with IIC but not EDID. If you simple keep reading, you read through the entire device, don’t you? (Writes wrap within the write page, reads don’t, in general.) So why is it necessary to write an address at all? Assuming that is is necessary, though, I’d suggest adding a function to set the read base address, rather than adding a function with arbitrary write capability. The idea is to limit the damage that can be done – blitzing the EDID info – by an inexpert call.

Jul 3, 2014 12:41pm Jeffrey Lee (213) 6048 posts	So why is it necessary to write an address at all? Because that’s what the spec says! Basically the EPROM at bus address &50 only allows access to up to 256 bytes of data at a time, using the standard protocol (write one byte containing the start address, then read/write N bytes to access the data). For devices with more than 256 bytes of EDID there’s a second device at address &30 which has a ‘segment pointer’ register which controls which 256 byte page you’re accessing. Also, after double-checking the spec, it doesn’t look like it is possible to use the current GraphicsV API to program the segment pointer – it doesn’t use the standard register-based addressing scheme that most IIC devices use, and it needs to be a repeated-start transfer because the value resets to zero at the end of each transaction. Assuming that is is necessary, though, I’d suggest adding a function to set the read base address, rather than adding a function with arbitrary write capability. The idea is to limit the damage that can be done – blitzing the EDID info – by an inexpert call. Good point, I’ve been bitten by that in the past. So it looks like the options are: Keep GraphicsV_IICOp as a high-level interface. Make it internally set the segment pointer to the right value. Disallow write operations (or require some magic key in another register?) Turn GraphicsV_IICOp into a lower-level interface similar to OS_IICOp. Pray you don’t run some buggy code which nukes your EDID. AFAIK EDID is the only interesting thing available via DDC (there’s DDC/CI, but it doesn’t appear to be widely supported by monitors). So although a full OS_IICOp-like interface would be nice, it probably isn’t worth implementing, and we’d be better off going down the route of option 1.

Jul 3, 2014 12:52pm Dave Higton (1515) 3526 posts	Disallow write operations (or require some magic key in another register?) Disallow them. There is no reason ever to write anything other than a segment pointer to the EDID device. All anything else will do is cause damage to the information stored there. A GraphicsV_IICOp operation can by all means specify the segment address, and handle writing it internally.

Jul 3, 2014 1:46pm Sprow (202) 1158 posts	So it looks like the options are: 1. Keep GraphicsV_IICOp as a high-level interface. Make it internally set the segment pointer to the right value. Disallow write operations (or require some magic key in another register?) If (1) is chosen, in GraphicsV 14 the offset is handily defined as 16 bit so the page register poke could happen by magic so the caller assumes the address space is flat, or bits 16-23 could accept writes at 0×30, or one of the 8 reserved bits could be used to flag something or other. Options options.

Jul 3, 2014 2:05pm Jeffrey Lee (213) 6048 posts	I’d be in favour of simplify specifying it in bits 0-15. Requiring two separate calls (one to set the segment pointer, one to read the EDID) won’t really work due to the way that the segment pointer is implemented in hardware. So it would be a case of ‘if bits 16-23 == 0xa1 and bits 8-15 != 0 then write the segment pointer at the start of the transfer’.

Jul 3, 2014 5:38pm Rick Murray (539) 13840 posts	The idea is to limit the damage that can be done – blitzing the EDID info – by an inexpert call. +1 Tale of woe. I normally run my Pi hooked to an analogue monitor (1280×1024) by way of an HDMI→VGA adaptor. There’s a tiny MCU in the adaptor that reads status information via the IIC lines of the monitor cable (I presume it transposes these into an EDID block). So the other weekend, I hook up a crappy Vista machine. The owner, a woman from work, uses MSIE, has no anti-virus, and complains that her mouse freezes when she visits Facebook. Somehow it was decided that I was the best guy to fix it (or maybe my French and/or politeness is not up to replying “casse toi!”). The machine, connected via normal VGA, starts up and cycles through half a dozen resolutions until it finds one it likes (does Vista normally do that?). Yeah, it’s treacle slow, but try as I might, I can’t get the mouse to freeze up any longer than you would expect for the intermittant “this thing is busy” pauses that you get from Windows when it is…busy. Whatever. I plug the Pi back in. Oh, that looks horrible. And the screen placement is way off. The monitor’s setup menu tells me that I’m in an 800×600 mode. Say what? `hdmi_drive=2 hdmi_group=2 hdmi_mode=36` This is supposed to force a DMT mode that is 1280×1024 at 75Hz. Only, it looks as if the GPU is receiving something completely different from the monitor. I power everything down, leave it for fifteen minutes, then power up. 800×600. I hate Microsoft. Thankfully there is a “fix”. I need to set my Pi’s configuration as follows: `hdmi_ignore_edid=0xa5000080 hdmi_force_hotplug=1 hdmi_drive=2 hdmi_group=2 hdmi_mode=36` This instructs the GPU to ignore any EDID that is being read, and to specify that an HDMI display is connected. Power up like that, I’m back to RISC OS in a 1280×1024 mode. Comment out the additional two lines, reboot, it stays in 1280×1024, so I guess either the monitor has sorted itself out, or the Pi’s GPU has updated the EDID? I kind of wish the EPROM used was an old-fashioned one so I can just pop out the R/!W leg and pull it high so stuff leaves the settings the hell alone. Maybe this weekend, I have to put that Vista box back onto the network (which means I’ll be stuck with just the iPad – no way I’m having that on the intranet at the same time as any of my PCs) to install Avast! and Firefox. If the mouse doesn’t mess up, I’ll send the machine back. I’d like to run a check on the box using a Pendrive Linux with A/V, but the box is too dumb to know how to boot off of that sort of media. Pffft! [and then, I guess, using the Pi to ‘rescue’ the monitor’s sanity] Disallow write operations (or require some magic key in another register?) […] Pray you don’t run some buggy code which nukes your EDID. Can anybody here (and given what I have described above) come up with a useful real-world case for having the EDID writeable?¹ [and, furthermore, if the EDID is supposed to describe the capabilities of the monitor, why is it writeable in the first place?] ¹ Discount the case of “manufacturer is an idiot and messed up all the timings”; this is a production fault at best. It shouldn’t be up to random operating systems to “fix” what it perceives as being incorrect.

Jul 4, 2014 7:10am Chris Evans (457) 1614 posts	I’ve just realised that for our Pi Laptop project it will be a lot easier for us and users, if we do include an EDID EEPROM on the HDMI bus and are able to write to it. So please include a way of writing to it.

Jul 4, 2014 7:42am Dave Higton (1515) 3526 posts	Chris: you, as manufacturer, are in a different position from all the users. I believe the correct way forward is: there is no normally available way to write to the EDID chip (other than an embedded and idiot-resistant way to write the page address and nothing more); you modify an HDMI cable, specially for your manufacturing process, to connect to an IIC bus on a device of your choice, to write your chips in manufacturing.