2014-12-09 00:09:10 +08:00
/*
* Copyright © 2014 Intel Corporation
*
* Permission is hereby granted , free of charge , to any person obtaining a
* copy of this software and associated documentation files ( the " Software " ) ,
* to deal in the Software without restriction , including without limitation
* the rights to use , copy , modify , merge , publish , distribute , sublicense ,
* and / or sell copies of the Software , and to permit persons to whom the
* Software is furnished to do so , subject to the following conditions :
*
* The above copyright notice and this permission notice ( including the next
* paragraph ) shall be included in all copies or substantial portions of the
* Software .
*
* THE SOFTWARE IS PROVIDED " AS IS " , WITHOUT WARRANTY OF ANY KIND , EXPRESS OR
* IMPLIED , INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY ,
* FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT . IN NO EVENT SHALL
* THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM , DAMAGES OR OTHER
* LIABILITY , WHETHER IN AN ACTION OF CONTRACT , TORT OR OTHERWISE , ARISING
* FROM , OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
* DEALINGS IN THE SOFTWARE .
*/
2014-12-08 22:46:31 +08:00
/**
* DOC : Frame Buffer Compression ( FBC )
*
* FBC tries to save memory bandwidth ( and so power consumption ) by
* compressing the amount of memory used by the display . It is total
* transparent to user space and completely handled in the kernel .
2014-12-09 00:09:10 +08:00
*
* The benefits of FBC are mostly visible with solid backgrounds and
2014-12-08 22:46:31 +08:00
* variation - less patterns . It comes from keeping the memory footprint small
* and having fewer memory pages opened and accessed for refreshing the display .
2014-12-09 00:09:10 +08:00
*
2014-12-08 22:46:31 +08:00
* i915 is responsible to reserve stolen memory for FBC and configure its
* offset on proper registers . The hardware takes care of all
* compress / decompress . However there are many known cases where we have to
* forcibly disable it to allow proper screen updates .
2014-12-09 00:09:10 +08:00
*/
2014-12-08 22:46:31 +08:00
# include "intel_drv.h"
# include "i915_drv.h"
2015-09-23 23:52:27 +08:00
static inline bool fbc_supported ( struct drm_i915_private * dev_priv )
{
2015-10-15 04:45:36 +08:00
return dev_priv - > fbc . activate ! = NULL ;
2015-09-23 23:52:27 +08:00
}
2015-11-05 03:10:46 +08:00
static inline bool fbc_on_pipe_a_only ( struct drm_i915_private * dev_priv )
{
return IS_HASWELL ( dev_priv ) | | INTEL_INFO ( dev_priv ) - > gen > = 8 ;
}
2015-10-17 04:55:40 +08:00
static inline bool fbc_on_plane_a_only ( struct drm_i915_private * dev_priv )
{
return INTEL_INFO ( dev_priv ) - > gen < 4 ;
}
2015-09-15 02:20:03 +08:00
/*
* In some platforms where the CRTC ' s x : 0 / y : 0 coordinates doesn ' t match the
* frontbuffer ' s x : 0 / y : 0 coordinates we lie to the hardware about the plane ' s
* origin so the x and y offsets can actually fit the registers . As a
* consequence , the fence doesn ' t really start exactly at the display plane
* address we program because it starts at the real start of the buffer , so we
* have to take this into consideration here .
*/
static unsigned int get_crtc_fence_y_offset ( struct intel_crtc * crtc )
{
return crtc - > base . y - crtc - > adjusted_y ;
}
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
/*
* For SKL + , the plane source size used by the hardware is based on the value we
* write to the PLANE_SIZE register . For BDW - , the hardware looks at the value
* we wrote to PIPESRC .
*/
static void intel_fbc_get_plane_source_size ( struct intel_crtc * crtc ,
int * width , int * height )
{
struct intel_plane_state * plane_state =
to_intel_plane_state ( crtc - > base . primary - > state ) ;
int w , h ;
if ( intel_rotation_90_or_270 ( plane_state - > base . rotation ) ) {
w = drm_rect_height ( & plane_state - > src ) > > 16 ;
h = drm_rect_width ( & plane_state - > src ) > > 16 ;
} else {
w = drm_rect_width ( & plane_state - > src ) > > 16 ;
h = drm_rect_height ( & plane_state - > src ) > > 16 ;
}
if ( width )
* width = w ;
if ( height )
* height = h ;
}
static int intel_fbc_calculate_cfb_size ( struct intel_crtc * crtc ,
struct drm_framebuffer * fb )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
int lines ;
intel_fbc_get_plane_source_size ( crtc , NULL , & lines ) ;
if ( INTEL_INFO ( dev_priv ) - > gen > = 7 )
lines = min ( lines , 2048 ) ;
/* Hardware needs the full buffer stride, not just the active area. */
return lines * fb - > pitches [ 0 ] ;
}
2015-10-15 04:45:36 +08:00
static void i8xx_fbc_deactivate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
u32 fbc_ctl ;
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = false ;
2014-12-09 00:09:10 +08:00
/* Disable compression */
fbc_ctl = I915_READ ( FBC_CONTROL ) ;
if ( ( fbc_ctl & FBC_CTL_EN ) = = 0 )
return ;
fbc_ctl & = ~ FBC_CTL_EN ;
I915_WRITE ( FBC_CONTROL , fbc_ctl ) ;
/* Wait for compressing bit to clear */
if ( wait_for ( ( I915_READ ( FBC_STATUS ) & FBC_STAT_COMPRESSING ) = = 0 , 10 ) ) {
DRM_DEBUG_KMS ( " FBC idle timed out \n " ) ;
return ;
}
}
2015-12-24 04:28:11 +08:00
static void i8xx_fbc_activate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-12-24 04:28:11 +08:00
struct intel_fbc_reg_params * params = & dev_priv - > fbc . params ;
2014-12-09 00:09:10 +08:00
int cfb_pitch ;
int i ;
u32 fbc_ctl ;
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = true ;
2014-12-09 00:09:10 +08:00
2015-02-05 18:04:27 +08:00
/* Note: fbc.threshold == 1 for i8xx */
2015-12-24 04:28:11 +08:00
cfb_pitch = params - > cfb_size / FBC_LL_SIZE ;
if ( params - > fb . stride < cfb_pitch )
cfb_pitch = params - > fb . stride ;
2014-12-09 00:09:10 +08:00
/* FBC_CTL wants 32B or 64B units */
2015-07-08 02:26:04 +08:00
if ( IS_GEN2 ( dev_priv ) )
2014-12-09 00:09:10 +08:00
cfb_pitch = ( cfb_pitch / 32 ) - 1 ;
else
cfb_pitch = ( cfb_pitch / 64 ) - 1 ;
/* Clear old tags */
for ( i = 0 ; i < ( FBC_LL_SIZE / 32 ) + 1 ; i + + )
2015-09-19 01:03:18 +08:00
I915_WRITE ( FBC_TAG ( i ) , 0 ) ;
2014-12-09 00:09:10 +08:00
2015-07-08 02:26:04 +08:00
if ( IS_GEN4 ( dev_priv ) ) {
2014-12-09 00:09:10 +08:00
u32 fbc_ctl2 ;
/* Set it up... */
fbc_ctl2 = FBC_CTL_FENCE_DBL | FBC_CTL_IDLE_IMM | FBC_CTL_CPU_FENCE ;
2015-12-24 04:28:11 +08:00
fbc_ctl2 | = FBC_CTL_PLANE ( params - > crtc . plane ) ;
2014-12-09 00:09:10 +08:00
I915_WRITE ( FBC_CONTROL2 , fbc_ctl2 ) ;
2015-12-24 04:28:11 +08:00
I915_WRITE ( FBC_FENCE_OFF , params - > crtc . fence_y_offset ) ;
2014-12-09 00:09:10 +08:00
}
/* enable it... */
fbc_ctl = I915_READ ( FBC_CONTROL ) ;
fbc_ctl & = 0x3fff < < FBC_CTL_INTERVAL_SHIFT ;
fbc_ctl | = FBC_CTL_EN | FBC_CTL_PERIODIC ;
2015-07-08 02:26:04 +08:00
if ( IS_I945GM ( dev_priv ) )
2014-12-09 00:09:10 +08:00
fbc_ctl | = FBC_CTL_C3_IDLE ; /* 945 needs special SR handling */
fbc_ctl | = ( cfb_pitch & 0xff ) < < FBC_CTL_STRIDE_SHIFT ;
2015-12-24 04:28:11 +08:00
fbc_ctl | = params - > fb . fence_reg ;
2014-12-09 00:09:10 +08:00
I915_WRITE ( FBC_CONTROL , fbc_ctl ) ;
}
2015-10-15 04:45:36 +08:00
static bool i8xx_fbc_is_active ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
return I915_READ ( FBC_CONTROL ) & FBC_CTL_EN ;
}
2015-12-24 04:28:11 +08:00
static void g4x_fbc_activate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-12-24 04:28:11 +08:00
struct intel_fbc_reg_params * params = & dev_priv - > fbc . params ;
2014-12-09 00:09:10 +08:00
u32 dpfc_ctl ;
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = true ;
2014-12-09 00:09:10 +08:00
2015-12-24 04:28:11 +08:00
dpfc_ctl = DPFC_CTL_PLANE ( params - > crtc . plane ) | DPFC_SR_EN ;
if ( drm_format_plane_cpp ( params - > fb . pixel_format , 0 ) = = 2 )
2014-12-09 00:09:10 +08:00
dpfc_ctl | = DPFC_CTL_LIMIT_2X ;
else
dpfc_ctl | = DPFC_CTL_LIMIT_1X ;
2015-12-24 04:28:11 +08:00
dpfc_ctl | = DPFC_CTL_FENCE_EN | params - > fb . fence_reg ;
2014-12-09 00:09:10 +08:00
2015-12-24 04:28:11 +08:00
I915_WRITE ( DPFC_FENCE_YOFF , params - > crtc . fence_y_offset ) ;
2014-12-09 00:09:10 +08:00
/* enable it... */
I915_WRITE ( DPFC_CONTROL , dpfc_ctl | DPFC_CTL_EN ) ;
}
2015-10-15 04:45:36 +08:00
static void g4x_fbc_deactivate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
u32 dpfc_ctl ;
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = false ;
2014-12-09 00:09:10 +08:00
/* Disable compression */
dpfc_ctl = I915_READ ( DPFC_CONTROL ) ;
if ( dpfc_ctl & DPFC_CTL_EN ) {
dpfc_ctl & = ~ DPFC_CTL_EN ;
I915_WRITE ( DPFC_CONTROL , dpfc_ctl ) ;
}
}
2015-10-15 04:45:36 +08:00
static bool g4x_fbc_is_active ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
return I915_READ ( DPFC_CONTROL ) & DPFC_CTL_EN ;
}
2015-11-05 03:10:45 +08:00
/* This function forces a CFB recompression through the nuke operation. */
static void intel_fbc_recompress ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-02-14 03:23:46 +08:00
I915_WRITE ( MSG_FBC_REND_STATE , FBC_REND_NUKE ) ;
POSTING_READ ( MSG_FBC_REND_STATE ) ;
2014-12-09 00:09:10 +08:00
}
2015-12-24 04:28:11 +08:00
static void ilk_fbc_activate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-12-24 04:28:11 +08:00
struct intel_fbc_reg_params * params = & dev_priv - > fbc . params ;
2014-12-09 00:09:10 +08:00
u32 dpfc_ctl ;
2015-06-30 21:53:05 +08:00
int threshold = dev_priv - > fbc . threshold ;
2014-12-09 00:09:10 +08:00
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = true ;
2014-12-09 00:09:10 +08:00
2015-12-24 04:28:11 +08:00
dpfc_ctl = DPFC_CTL_PLANE ( params - > crtc . plane ) ;
if ( drm_format_plane_cpp ( params - > fb . pixel_format , 0 ) = = 2 )
2015-06-30 21:53:05 +08:00
threshold + + ;
2014-12-09 00:09:10 +08:00
2015-06-30 21:53:05 +08:00
switch ( threshold ) {
2014-12-09 00:09:10 +08:00
case 4 :
case 3 :
dpfc_ctl | = DPFC_CTL_LIMIT_4X ;
break ;
case 2 :
dpfc_ctl | = DPFC_CTL_LIMIT_2X ;
break ;
case 1 :
dpfc_ctl | = DPFC_CTL_LIMIT_1X ;
break ;
}
dpfc_ctl | = DPFC_CTL_FENCE_EN ;
2015-07-08 02:26:04 +08:00
if ( IS_GEN5 ( dev_priv ) )
2015-12-24 04:28:11 +08:00
dpfc_ctl | = params - > fb . fence_reg ;
2014-12-09 00:09:10 +08:00
2015-12-24 04:28:11 +08:00
I915_WRITE ( ILK_DPFC_FENCE_YOFF , params - > crtc . fence_y_offset ) ;
I915_WRITE ( ILK_FBC_RT_BASE , params - > fb . ggtt_offset | ILK_FBC_RT_VALID ) ;
2014-12-09 00:09:10 +08:00
/* enable it... */
I915_WRITE ( ILK_DPFC_CONTROL , dpfc_ctl | DPFC_CTL_EN ) ;
2015-07-08 02:26:04 +08:00
if ( IS_GEN6 ( dev_priv ) ) {
2014-12-09 00:09:10 +08:00
I915_WRITE ( SNB_DPFC_CTL_SA ,
2015-12-24 04:28:11 +08:00
SNB_CPU_FENCE_ENABLE | params - > fb . fence_reg ) ;
I915_WRITE ( DPFC_CPU_FENCE_OFFSET , params - > crtc . fence_y_offset ) ;
2014-12-09 00:09:10 +08:00
}
2015-11-05 03:10:45 +08:00
intel_fbc_recompress ( dev_priv ) ;
2014-12-09 00:09:10 +08:00
}
2015-10-15 04:45:36 +08:00
static void ilk_fbc_deactivate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
u32 dpfc_ctl ;
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = false ;
2014-12-09 00:09:10 +08:00
/* Disable compression */
dpfc_ctl = I915_READ ( ILK_DPFC_CONTROL ) ;
if ( dpfc_ctl & DPFC_CTL_EN ) {
dpfc_ctl & = ~ DPFC_CTL_EN ;
I915_WRITE ( ILK_DPFC_CONTROL , dpfc_ctl ) ;
}
}
2015-10-15 04:45:36 +08:00
static bool ilk_fbc_is_active ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
return I915_READ ( ILK_DPFC_CONTROL ) & DPFC_CTL_EN ;
}
2015-12-24 04:28:11 +08:00
static void gen7_fbc_activate ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-12-24 04:28:11 +08:00
struct intel_fbc_reg_params * params = & dev_priv - > fbc . params ;
2014-12-09 00:09:10 +08:00
u32 dpfc_ctl ;
2015-06-30 21:53:05 +08:00
int threshold = dev_priv - > fbc . threshold ;
2014-12-09 00:09:10 +08:00
2015-10-15 04:45:36 +08:00
dev_priv - > fbc . active = true ;
2014-12-09 00:09:10 +08:00
2015-06-13 01:36:21 +08:00
dpfc_ctl = 0 ;
2015-07-08 02:26:04 +08:00
if ( IS_IVYBRIDGE ( dev_priv ) )
2015-12-24 04:28:11 +08:00
dpfc_ctl | = IVB_DPFC_CTL_PLANE ( params - > crtc . plane ) ;
2015-06-13 01:36:21 +08:00
2015-12-24 04:28:11 +08:00
if ( drm_format_plane_cpp ( params - > fb . pixel_format , 0 ) = = 2 )
2015-06-30 21:53:05 +08:00
threshold + + ;
2014-12-09 00:09:10 +08:00
2015-06-30 21:53:05 +08:00
switch ( threshold ) {
2014-12-09 00:09:10 +08:00
case 4 :
case 3 :
dpfc_ctl | = DPFC_CTL_LIMIT_4X ;
break ;
case 2 :
dpfc_ctl | = DPFC_CTL_LIMIT_2X ;
break ;
case 1 :
dpfc_ctl | = DPFC_CTL_LIMIT_1X ;
break ;
}
dpfc_ctl | = IVB_DPFC_CTL_FENCE_EN ;
if ( dev_priv - > fbc . false_color )
dpfc_ctl | = FBC_CTL_FALSE_COLOR ;
2015-07-08 02:26:04 +08:00
if ( IS_IVYBRIDGE ( dev_priv ) ) {
2014-12-09 00:09:10 +08:00
/* WaFbcAsynchFlipDisableFbcQueue:ivb */
I915_WRITE ( ILK_DISPLAY_CHICKEN1 ,
I915_READ ( ILK_DISPLAY_CHICKEN1 ) |
ILK_FBCQ_DIS ) ;
2015-09-15 02:20:01 +08:00
} else if ( IS_HASWELL ( dev_priv ) | | IS_BROADWELL ( dev_priv ) ) {
2014-12-09 00:09:10 +08:00
/* WaFbcAsynchFlipDisableFbcQueue:hsw,bdw */
2015-12-24 04:28:11 +08:00
I915_WRITE ( CHICKEN_PIPESL_1 ( params - > crtc . pipe ) ,
I915_READ ( CHICKEN_PIPESL_1 ( params - > crtc . pipe ) ) |
2014-12-09 00:09:10 +08:00
HSW_FBCQ_DIS ) ;
}
2015-09-15 02:20:00 +08:00
I915_WRITE ( ILK_DPFC_CONTROL , dpfc_ctl | DPFC_CTL_EN ) ;
2014-12-09 00:09:10 +08:00
I915_WRITE ( SNB_DPFC_CTL_SA ,
2015-12-24 04:28:11 +08:00
SNB_CPU_FENCE_ENABLE | params - > fb . fence_reg ) ;
I915_WRITE ( DPFC_CPU_FENCE_OFFSET , params - > crtc . fence_y_offset ) ;
2014-12-09 00:09:10 +08:00
2015-11-05 03:10:45 +08:00
intel_fbc_recompress ( dev_priv ) ;
2014-12-09 00:09:10 +08:00
}
2014-12-08 22:46:31 +08:00
/**
2015-10-15 04:45:36 +08:00
* intel_fbc_is_active - Is FBC active ?
2015-07-08 02:26:04 +08:00
* @ dev_priv : i915 device instance
2014-12-08 22:46:31 +08:00
*
* This function is used to verify the current state of FBC .
* FIXME : This should be tracked in the plane config eventually
* instead of queried at runtime for most callers .
*/
2015-10-15 04:45:36 +08:00
bool intel_fbc_is_active ( struct drm_i915_private * dev_priv )
2014-12-09 00:09:10 +08:00
{
2015-10-15 04:45:36 +08:00
return dev_priv - > fbc . active ;
2014-12-09 00:09:10 +08:00
}
static void intel_fbc_work_fn ( struct work_struct * __work )
{
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
struct drm_i915_private * dev_priv =
container_of ( __work , struct drm_i915_private , fbc . work . work ) ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
struct intel_fbc_work * work = & fbc - > work ;
struct intel_crtc * crtc = fbc - > crtc ;
2016-01-22 04:03:05 +08:00
struct drm_vblank_crtc * vblank = & dev_priv - > dev - > vblank [ crtc - > pipe ] ;
if ( drm_crtc_vblank_get ( & crtc - > base ) ) {
DRM_ERROR ( " vblank not available for FBC on pipe %c \n " ,
pipe_name ( crtc - > pipe ) ) ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2016-01-22 04:03:05 +08:00
work - > scheduled = false ;
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2016-01-22 04:03:05 +08:00
return ;
}
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
retry :
/* Delay the actual enabling to let pageflipping cease and the
* display to settle before starting the compression . Note that
* this delay also serves a second purpose : it allows for a
* vblank to pass after disabling the FBC before we attempt
* to modify the control registers .
*
* WaFbcWaitForVBlankBeforeEnable : ilk , snb
2016-01-22 04:03:05 +08:00
*
* It is also worth mentioning that since work - > scheduled_vblank can be
* updated multiple times by the other threads , hitting the timeout is
* not an error condition . We ' ll just end up hitting the " goto retry "
* case below .
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
*/
2016-01-22 04:03:05 +08:00
wait_event_timeout ( vblank - > queue ,
drm_crtc_vblank_count ( & crtc - > base ) ! = work - > scheduled_vblank ,
msecs_to_jiffies ( 50 ) ) ;
2014-12-09 00:09:10 +08:00
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2014-12-09 00:09:10 +08:00
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
/* Were we cancelled? */
if ( ! work - > scheduled )
goto out ;
/* Were we delayed again while this function was sleeping? */
2016-01-22 04:03:05 +08:00
if ( drm_crtc_vblank_count ( & crtc - > base ) = = work - > scheduled_vblank ) {
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
goto retry ;
2014-12-09 00:09:10 +08:00
}
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
if ( crtc - > base . primary - > fb = = work - > fb )
2016-01-12 03:44:36 +08:00
fbc - > activate ( dev_priv ) ;
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
work - > scheduled = false ;
out :
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2016-01-22 04:03:05 +08:00
drm_crtc_vblank_put ( & crtc - > base ) ;
2014-12-09 00:09:10 +08:00
}
static void intel_fbc_cancel_work ( struct drm_i915_private * dev_priv )
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
WARN_ON ( ! mutex_is_locked ( & fbc - > lock ) ) ;
fbc - > work . scheduled = false ;
2014-12-09 00:09:10 +08:00
}
2015-10-15 04:45:36 +08:00
static void intel_fbc_schedule_activation ( struct intel_crtc * crtc )
2014-12-09 00:09:10 +08:00
{
2015-07-08 02:26:05 +08:00
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
struct intel_fbc_work * work = & fbc - > work ;
2014-12-09 00:09:10 +08:00
2016-01-12 03:44:36 +08:00
WARN_ON ( ! mutex_is_locked ( & fbc - > lock ) ) ;
2015-07-03 06:25:10 +08:00
2016-01-22 04:03:05 +08:00
if ( drm_crtc_vblank_get ( & crtc - > base ) ) {
DRM_ERROR ( " vblank not available for FBC on pipe %c \n " ,
pipe_name ( crtc - > pipe ) ) ;
return ;
}
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
/* It is useless to call intel_fbc_cancel_work() in this function since
* we ' re not releasing fbc . lock , so it won ' t have an opportunity to grab
* it to discover that it was cancelled . So we just update the expected
* jiffy count . */
2015-07-08 02:26:05 +08:00
work - > fb = crtc - > base . primary - > fb ;
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
work - > scheduled = true ;
2016-01-22 04:03:05 +08:00
work - > scheduled_vblank = drm_crtc_vblank_count ( & crtc - > base ) ;
drm_crtc_vblank_put ( & crtc - > base ) ;
2014-12-09 00:09:10 +08:00
drm/i915: use a single intel_fbc_work struct
This was already on my TODO list, and was requested both by Chris and
Ville, for different reasons. The advantages are avoiding a frequent
malloc/free pair, and the locality of having the work structure
embedded in dev_priv. The maximum used memory is also smaller since
previously we could have multiple allocated intel_fbc_work structs at
the same time, and now we'll always have a single one - the one
embedded on dev_priv. Of course, we're now using a little more memory
on the cases where there's nothing scheduled.
The biggest challenge here is to keep everything synchronized the way
it was before.
Currently, when we try to activate FBC, we allocate a new
intel_fbc_work structure. Then later when we conclude we must delay
the FBC activation a little more, we allocate a new intel_fbc_work
struct, and then adjust dev_priv->fbc.fbc_work to point to the new
struct. So when the old work runs - at intel_fbc_work_fn() - it will
check that dev_priv->fbc.fbc_work points to something else, so it does
nothing. Everything is also protected by fbc.lock.
Just cancelling the old delayed work doesn't work because we might
just cancel it after the work function already started to run, but
while it is still waiting to grab fbc.lock. That's why we use the
"dev_priv->fbc.fbc_work == work" check described in the paragraph
above.
So now that we have a single work struct we have to introduce a new
way to synchronize everything. So we're making the work function a
normal work instead of a delayed work, and it will be responsible for
sleeping the appropriate amount of time itself. This way, after it
wakes up it can grab the lock, ask "were we delayed or cancelled?" and
then go back to sleep, enable FBC or give up.
v2:
- Spelling fixes.
- Rebase after changing the patch order.
- Fix ms/jiffies confusion.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> (v1)
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-27 02:27:49 +08:00
schedule_work ( & work - > work ) ;
2014-12-09 00:09:10 +08:00
}
2015-10-15 21:44:46 +08:00
static void __intel_fbc_deactivate ( struct drm_i915_private * dev_priv )
2015-07-03 06:25:10 +08:00
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
WARN_ON ( ! mutex_is_locked ( & fbc - > lock ) ) ;
2015-07-03 06:25:10 +08:00
intel_fbc_cancel_work ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
if ( fbc - > active )
fbc - > deactivate ( dev_priv ) ;
2015-10-14 06:13:25 +08:00
}
2015-07-03 06:25:10 +08:00
/*
2015-10-15 21:44:46 +08:00
* intel_fbc_deactivate - deactivate FBC if it ' s associated with crtc
2015-07-03 06:25:10 +08:00
* @ crtc : the CRTC
*
2015-10-15 21:44:46 +08:00
* This function deactivates FBC if it ' s associated with the provided CRTC .
2015-07-03 06:25:10 +08:00
*/
2015-10-15 21:44:46 +08:00
void intel_fbc_deactivate ( struct intel_crtc * crtc )
2015-07-03 06:25:10 +08:00
{
2015-07-08 02:26:04 +08:00
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2014-12-09 00:09:10 +08:00
2015-09-23 23:52:27 +08:00
if ( ! fbc_supported ( dev_priv ) )
2015-07-04 02:40:54 +08:00
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
if ( fbc - > crtc = = crtc )
2015-10-15 21:44:46 +08:00
__intel_fbc_deactivate ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2014-12-09 00:09:10 +08:00
}
2015-06-13 01:36:20 +08:00
static void set_no_fbc_reason ( struct drm_i915_private * dev_priv ,
2015-10-28 00:50:03 +08:00
const char * reason )
2014-12-09 00:09:10 +08:00
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
if ( fbc - > no_fbc_reason = = reason )
2015-06-13 01:36:20 +08:00
return ;
2014-12-09 00:09:10 +08:00
2016-01-12 03:44:36 +08:00
fbc - > no_fbc_reason = reason ;
2015-10-28 00:50:03 +08:00
DRM_DEBUG_KMS ( " Disabling FBC: %s \n " , reason ) ;
2014-12-09 00:09:10 +08:00
}
2015-10-15 21:44:46 +08:00
static bool crtc_can_fbc ( struct intel_crtc * crtc )
2015-11-05 03:10:48 +08:00
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
if ( fbc_on_pipe_a_only ( dev_priv ) & & crtc - > pipe ! = PIPE_A )
return false ;
2015-10-17 04:55:40 +08:00
if ( fbc_on_plane_a_only ( dev_priv ) & & crtc - > plane ! = PLANE_A )
return false ;
2015-10-15 21:44:46 +08:00
return true ;
}
2015-07-08 02:26:07 +08:00
static bool multiple_pipes_ok ( struct drm_i915_private * dev_priv )
{
enum pipe pipe ;
int n_pipes = 0 ;
struct drm_crtc * crtc ;
if ( INTEL_INFO ( dev_priv ) - > gen > 4 )
return true ;
for_each_pipe ( dev_priv , pipe ) {
crtc = dev_priv - > pipe_to_crtc_mapping [ pipe ] ;
if ( intel_crtc_active ( crtc ) & &
to_intel_plane_state ( crtc - > primary - > state ) - > visible )
n_pipes + + ;
}
return ( n_pipes < 2 ) ;
}
2015-07-08 02:26:04 +08:00
static int find_compression_threshold ( struct drm_i915_private * dev_priv ,
2015-07-03 06:25:08 +08:00
struct drm_mm_node * node ,
int size ,
int fb_cpp )
{
int compression_threshold = 1 ;
int ret ;
2015-09-15 02:19:57 +08:00
u64 end ;
/* The FBC hardware for BDW/SKL doesn't have access to the stolen
* reserved range size , so it always assumes the maximum ( 8 mb ) is used .
* If we enable FBC using a CFB on that memory range we ' ll get FIFO
* underruns , even if that range is not reserved by the BIOS . */
2015-10-28 19:16:45 +08:00
if ( IS_BROADWELL ( dev_priv ) | |
IS_SKYLAKE ( dev_priv ) | | IS_KABYLAKE ( dev_priv ) )
2015-09-15 02:19:57 +08:00
end = dev_priv - > gtt . stolen_size - 8 * 1024 * 1024 ;
else
end = dev_priv - > gtt . stolen_usable_size ;
2015-07-03 06:25:08 +08:00
/* HACK: This code depends on what we will do in *_enable_fbc. If that
* code changes , this code needs to change as well .
*
* The enable_fbc code will attempt to use one of our 2 compression
* thresholds , therefore , in that case , we only have 1 resort .
*/
/* Try to over-allocate to reduce reallocations and fragmentation. */
2015-09-15 02:19:57 +08:00
ret = i915_gem_stolen_insert_node_in_range ( dev_priv , node , size < < = 1 ,
4096 , 0 , end ) ;
2015-07-03 06:25:08 +08:00
if ( ret = = 0 )
return compression_threshold ;
again :
/* HW's ability to limit the CFB is 1:4 */
if ( compression_threshold > 4 | |
( fb_cpp = = 2 & & compression_threshold = = 2 ) )
return 0 ;
2015-09-15 02:19:57 +08:00
ret = i915_gem_stolen_insert_node_in_range ( dev_priv , node , size > > = 1 ,
4096 , 0 , end ) ;
2015-07-08 02:26:04 +08:00
if ( ret & & INTEL_INFO ( dev_priv ) - > gen < = 4 ) {
2015-07-03 06:25:08 +08:00
return 0 ;
} else if ( ret ) {
compression_threshold < < = 1 ;
goto again ;
} else {
return compression_threshold ;
}
}
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
static int intel_fbc_alloc_cfb ( struct intel_crtc * crtc )
2015-07-03 06:25:08 +08:00
{
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
struct drm_framebuffer * fb = crtc - > base . primary - > state - > fb ;
2015-07-03 06:25:08 +08:00
struct drm_mm_node * uninitialized_var ( compressed_llb ) ;
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
int size , fb_cpp , ret ;
2016-01-12 03:44:36 +08:00
WARN_ON ( drm_mm_node_allocated ( & fbc - > compressed_fb ) ) ;
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
size = intel_fbc_calculate_cfb_size ( crtc , fb ) ;
fb_cpp = drm_format_plane_cpp ( fb - > pixel_format , 0 ) ;
2015-07-03 06:25:08 +08:00
2016-01-12 03:44:36 +08:00
ret = find_compression_threshold ( dev_priv , & fbc - > compressed_fb ,
2015-07-03 06:25:08 +08:00
size , fb_cpp ) ;
if ( ! ret )
goto err_llb ;
else if ( ret > 1 ) {
DRM_INFO ( " Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS. \n " ) ;
}
2016-01-12 03:44:36 +08:00
fbc - > threshold = ret ;
2015-07-03 06:25:08 +08:00
if ( INTEL_INFO ( dev_priv ) - > gen > = 5 )
2016-01-12 03:44:36 +08:00
I915_WRITE ( ILK_DPFC_CB_BASE , fbc - > compressed_fb . start ) ;
2015-07-08 02:26:04 +08:00
else if ( IS_GM45 ( dev_priv ) ) {
2016-01-12 03:44:36 +08:00
I915_WRITE ( DPFC_CB_BASE , fbc - > compressed_fb . start ) ;
2015-07-03 06:25:08 +08:00
} else {
compressed_llb = kzalloc ( sizeof ( * compressed_llb ) , GFP_KERNEL ) ;
if ( ! compressed_llb )
goto err_fb ;
ret = i915_gem_stolen_insert_node ( dev_priv , compressed_llb ,
4096 , 4096 ) ;
if ( ret )
goto err_fb ;
2016-01-12 03:44:36 +08:00
fbc - > compressed_llb = compressed_llb ;
2015-07-03 06:25:08 +08:00
I915_WRITE ( FBC_CFB_BASE ,
2016-01-12 03:44:36 +08:00
dev_priv - > mm . stolen_base + fbc - > compressed_fb . start ) ;
2015-07-03 06:25:08 +08:00
I915_WRITE ( FBC_LL_BASE ,
dev_priv - > mm . stolen_base + compressed_llb - > start ) ;
}
2015-09-15 02:19:58 +08:00
DRM_DEBUG_KMS ( " reserved %llu bytes of contiguous stolen space for FBC, threshold: %d \n " ,
2016-01-12 03:44:36 +08:00
fbc - > compressed_fb . size , fbc - > threshold ) ;
2015-07-03 06:25:08 +08:00
return 0 ;
err_fb :
kfree ( compressed_llb ) ;
2016-01-12 03:44:36 +08:00
i915_gem_stolen_remove_node ( dev_priv , & fbc - > compressed_fb ) ;
2015-07-03 06:25:08 +08:00
err_llb :
pr_info_once ( " drm: not enough stolen space for compressed buffer (need %d more bytes), disabling. Hint: you may be able to increase stolen memory size in the BIOS to avoid this. \n " , size ) ;
return - ENOSPC ;
}
2015-07-08 02:26:04 +08:00
static void __intel_fbc_cleanup_cfb ( struct drm_i915_private * dev_priv )
2015-07-03 06:25:08 +08:00
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
if ( drm_mm_node_allocated ( & fbc - > compressed_fb ) )
i915_gem_stolen_remove_node ( dev_priv , & fbc - > compressed_fb ) ;
if ( fbc - > compressed_llb ) {
i915_gem_stolen_remove_node ( dev_priv , fbc - > compressed_llb ) ;
kfree ( fbc - > compressed_llb ) ;
2015-07-03 06:25:08 +08:00
}
}
2015-07-08 02:26:04 +08:00
void intel_fbc_cleanup_cfb ( struct drm_i915_private * dev_priv )
2015-07-03 06:25:10 +08:00
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-09-23 23:52:27 +08:00
if ( ! fbc_supported ( dev_priv ) )
2015-07-04 02:40:54 +08:00
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2015-07-08 02:26:04 +08:00
__intel_fbc_cleanup_cfb ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-07-03 06:25:10 +08:00
}
2015-09-15 02:19:56 +08:00
static bool stride_is_valid ( struct drm_i915_private * dev_priv ,
unsigned int stride )
{
/* These should have been caught earlier. */
WARN_ON ( stride < 512 ) ;
WARN_ON ( ( stride & ( 64 - 1 ) ) ! = 0 ) ;
/* Below are the additional FBC restrictions. */
if ( IS_GEN2 ( dev_priv ) | | IS_GEN3 ( dev_priv ) )
return stride = = 4096 | | stride = = 8192 ;
if ( IS_GEN4 ( dev_priv ) & & ! IS_G4X ( dev_priv ) & & stride < 2048 )
return false ;
if ( stride > 16384 )
return false ;
return true ;
}
2015-09-22 06:48:06 +08:00
static bool pixel_format_is_valid ( struct drm_framebuffer * fb )
{
struct drm_device * dev = fb - > dev ;
struct drm_i915_private * dev_priv = dev - > dev_private ;
switch ( fb - > pixel_format ) {
case DRM_FORMAT_XRGB8888 :
case DRM_FORMAT_XBGR8888 :
return true ;
case DRM_FORMAT_XRGB1555 :
case DRM_FORMAT_RGB565 :
/* 16bpp not supported on gen2 */
if ( IS_GEN2 ( dev ) )
return false ;
/* WaFbcOnly1to1Ratio:ctg */
if ( IS_G4X ( dev_priv ) )
return false ;
return true ;
default :
return false ;
}
}
2015-10-02 06:57:12 +08:00
/*
* For some reason , the hardware tracking starts looking at whatever we
* programmed as the display plane base address register . It does not look at
* the X and Y offset registers . That ' s why we look at the crtc - > adjusted { x , y }
* variables instead of just looking at the pipe / plane size .
*/
static bool intel_fbc_hw_tracking_covers_screen ( struct intel_crtc * crtc )
2015-09-23 23:52:24 +08:00
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2015-10-02 06:57:12 +08:00
unsigned int effective_w , effective_h , max_w , max_h ;
2015-09-23 23:52:24 +08:00
if ( INTEL_INFO ( dev_priv ) - > gen > = 8 | | IS_HASWELL ( dev_priv ) ) {
max_w = 4096 ;
max_h = 4096 ;
} else if ( IS_G4X ( dev_priv ) | | INTEL_INFO ( dev_priv ) - > gen > = 5 ) {
max_w = 4096 ;
max_h = 2048 ;
} else {
max_w = 2048 ;
max_h = 1536 ;
}
2015-10-02 06:57:12 +08:00
intel_fbc_get_plane_source_size ( crtc , & effective_w , & effective_h ) ;
effective_w + = crtc - > adjusted_x ;
effective_h + = crtc - > adjusted_y ;
return effective_w < = max_w & & effective_h < = max_h ;
2015-09-23 23:52:24 +08:00
}
2016-01-19 21:35:35 +08:00
static bool intel_fbc_can_activate ( struct intel_crtc * crtc )
2014-12-09 00:09:10 +08:00
{
2015-10-14 06:13:25 +08:00
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2016-01-19 21:35:35 +08:00
struct drm_plane * primary ;
2014-12-09 00:09:10 +08:00
struct drm_framebuffer * fb ;
2016-01-19 21:35:35 +08:00
struct intel_plane_state * plane_state ;
2014-12-09 00:09:10 +08:00
struct drm_i915_gem_object * obj ;
const struct drm_display_mode * adjusted_mode ;
2016-01-19 21:35:35 +08:00
if ( ! intel_crtc_active ( & crtc - > base ) ) {
set_no_fbc_reason ( dev_priv , " CRTC not active " ) ;
return false ;
2015-07-08 02:26:08 +08:00
}
2014-12-09 00:09:10 +08:00
2016-01-19 21:35:35 +08:00
primary = crtc - > base . primary ;
fb = primary - > fb ;
2014-12-09 00:09:10 +08:00
obj = intel_fb_obj ( fb ) ;
2015-11-05 03:10:49 +08:00
adjusted_mode = & crtc - > config - > base . adjusted_mode ;
2016-01-19 21:35:35 +08:00
plane_state = to_intel_plane_state ( primary - > state ) ;
if ( ! plane_state - > visible ) {
set_no_fbc_reason ( dev_priv , " primary plane not visible " ) ;
return false ;
}
2014-12-09 00:09:10 +08:00
if ( ( adjusted_mode - > flags & DRM_MODE_FLAG_INTERLACE ) | |
( adjusted_mode - > flags & DRM_MODE_FLAG_DBLSCAN ) ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " incompatible mode " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2014-12-09 00:09:10 +08:00
}
2015-11-05 03:10:49 +08:00
if ( ! intel_fbc_hw_tracking_covers_screen ( crtc ) ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " mode too large for compression " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2014-12-09 00:09:10 +08:00
}
2015-09-23 23:52:24 +08:00
2014-12-09 00:09:10 +08:00
/* The use of a CPU fence is mandatory in order to detect writes
* by the CPU to the scanout and trigger updates to the FBC .
*/
if ( obj - > tiling_mode ! = I915_TILING_X | |
obj - > fence_reg = = I915_FENCE_REG_NONE ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " framebuffer not tiled or fenced " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2014-12-09 00:09:10 +08:00
}
2015-07-08 02:26:04 +08:00
if ( INTEL_INFO ( dev_priv ) - > gen < = 4 & & ! IS_G4X ( dev_priv ) & &
2016-01-19 21:35:35 +08:00
plane_state - > base . rotation ! = BIT ( DRM_ROTATE_0 ) ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " rotation unsupported " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2014-12-09 00:09:10 +08:00
}
2015-09-15 02:19:56 +08:00
if ( ! stride_is_valid ( dev_priv , fb - > pitches [ 0 ] ) ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " framebuffer stride not supported " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2015-09-15 02:19:56 +08:00
}
2015-09-22 06:48:06 +08:00
if ( ! pixel_format_is_valid ( fb ) ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " pixel format is invalid " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2015-09-22 06:48:06 +08:00
}
2015-09-15 02:19:59 +08:00
/* WaFbcExceedCdClockThreshold:hsw,bdw */
if ( ( IS_HASWELL ( dev_priv ) | | IS_BROADWELL ( dev_priv ) ) & &
2015-11-05 03:10:49 +08:00
ilk_pipe_pixel_rate ( crtc - > config ) > =
2015-09-15 02:19:59 +08:00
dev_priv - > cdclk_freq * 95 / 100 ) {
2015-10-28 00:50:03 +08:00
set_no_fbc_reason ( dev_priv , " pixel rate is too big " ) ;
2016-01-19 21:35:35 +08:00
return false ;
2015-09-15 02:19:59 +08:00
}
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
/* It is possible for the required CFB size change without a
* crtc - > disable + crtc - > enable since it is possible to change the
* stride without triggering a full modeset . Since we try to
* over - allocate the CFB , there ' s a chance we may keep FBC enabled even
* if this happens , but if we exceed the current CFB size we ' ll have to
* disable FBC . Notice that it would be possible to disable FBC , wait
* for a frame , free the stolen node , then try to reenable FBC in case
* we didn ' t get any invalidate / deactivate calls , but this would require
* a lot of tracking just for a specific case . If we conclude it ' s an
* important case , we can implement it later . */
if ( intel_fbc_calculate_cfb_size ( crtc , fb ) >
2016-01-12 03:44:36 +08:00
fbc - > compressed_fb . size * fbc - > threshold ) {
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
set_no_fbc_reason ( dev_priv , " CFB requirements changed " ) ;
2016-01-19 21:35:35 +08:00
return false ;
}
return true ;
}
2016-01-19 21:35:36 +08:00
static bool intel_fbc_can_enable ( struct intel_crtc * crtc )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
if ( intel_vgpu_active ( dev_priv - > dev ) ) {
set_no_fbc_reason ( dev_priv , " VGPU is active " ) ;
return false ;
}
if ( i915 . enable_fbc < 0 ) {
set_no_fbc_reason ( dev_priv , " disabled per chip default " ) ;
return false ;
}
if ( ! i915 . enable_fbc ) {
set_no_fbc_reason ( dev_priv , " disabled per module param " ) ;
return false ;
}
if ( ! crtc_can_fbc ( crtc ) ) {
set_no_fbc_reason ( dev_priv , " no enabled pipes can have FBC " ) ;
return false ;
}
return true ;
}
2015-12-24 04:28:11 +08:00
static void intel_fbc_get_reg_params ( struct intel_crtc * crtc ,
struct intel_fbc_reg_params * params )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
struct drm_framebuffer * fb = crtc - > base . primary - > fb ;
struct drm_i915_gem_object * obj = intel_fb_obj ( fb ) ;
/* Since all our fields are integer types, use memset here so the
* comparison function can rely on memcmp because the padding will be
* zero . */
memset ( params , 0 , sizeof ( * params ) ) ;
params - > crtc . pipe = crtc - > pipe ;
params - > crtc . plane = crtc - > plane ;
params - > crtc . fence_y_offset = get_crtc_fence_y_offset ( crtc ) ;
params - > fb . id = fb - > base . id ;
params - > fb . pixel_format = fb - > pixel_format ;
params - > fb . stride = fb - > pitches [ 0 ] ;
params - > fb . fence_reg = obj - > fence_reg ;
params - > cfb_size = intel_fbc_calculate_cfb_size ( crtc , fb ) ;
/* FIXME: We lack the proper locking here, so only run this on the
* platforms that need . */
if ( dev_priv - > fbc . activate = = ilk_fbc_activate )
params - > fb . ggtt_offset = i915_gem_obj_ggtt_offset ( obj ) ;
}
static bool intel_fbc_reg_params_equal ( struct intel_fbc_reg_params * params1 ,
struct intel_fbc_reg_params * params2 )
{
/* We can use this since intel_fbc_get_reg_params() does a memset. */
return memcmp ( params1 , params2 , sizeof ( * params1 ) ) = = 0 ;
}
2016-01-19 21:35:35 +08:00
/**
* __intel_fbc_update - activate / deactivate FBC as needed , unlocked
* @ crtc : the CRTC that triggered the update
*
* This function completely reevaluates the status of FBC , then activates ,
* deactivates or maintains it on the same state .
*/
static void __intel_fbc_update ( struct intel_crtc * crtc )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-12-24 04:28:11 +08:00
struct intel_fbc_reg_params old_params ;
2016-01-19 21:35:35 +08:00
2016-01-12 03:44:36 +08:00
WARN_ON ( ! mutex_is_locked ( & fbc - > lock ) ) ;
2016-01-19 21:35:35 +08:00
if ( ! multiple_pipes_ok ( dev_priv ) ) {
set_no_fbc_reason ( dev_priv , " more than one pipe active " ) ;
2014-12-09 00:09:10 +08:00
goto out_disable ;
}
2016-01-12 03:44:36 +08:00
if ( ! fbc - > enabled | | fbc - > crtc ! = crtc )
2016-01-19 21:35:35 +08:00
return ;
if ( ! intel_fbc_can_activate ( crtc ) )
goto out_disable ;
2016-01-12 03:44:36 +08:00
old_params = fbc - > params ;
intel_fbc_get_reg_params ( crtc , & fbc - > params ) ;
2015-12-24 04:28:11 +08:00
2014-12-09 00:09:10 +08:00
/* If the scanout has not changed, don't modify the FBC settings.
* Note that we make the fundamental assumption that the fb - > obj
* cannot be unpinned ( and have its GTT offset and fence revoked )
* without first being decoupled from the scanout and FBC disabled .
*/
2016-01-12 03:44:36 +08:00
if ( fbc - > active & &
intel_fbc_reg_params_equal ( & old_params , & fbc - > params ) )
2014-12-09 00:09:10 +08:00
return ;
2015-10-15 04:45:36 +08:00
if ( intel_fbc_is_active ( dev_priv ) ) {
2014-12-09 00:09:10 +08:00
/* We update FBC along two paths, after changing fb/crtc
* configuration ( modeswitching ) and after page - flipping
* finishes . For the latter , we know that not only did
* we disable the FBC at the start of the page - flip
* sequence , but also more than one vblank has passed .
*
* For the former case of modeswitching , it is possible
* to switch between two FBC valid configurations
* instantaneously so we do need to disable the FBC
* before we can modify its control registers . We also
* have to wait for the next vblank for that to take
* effect . However , since we delay enabling FBC we can
* assume that a vblank has passed since disabling and
* that we can safely alter the registers in the deferred
* callback .
*
* In the scenario that we go from a valid to invalid
* and then back to valid FBC configuration we have
* no strict enforcement that a vblank occurred since
* disabling the FBC . However , along all current pipe
* disabling paths we do need to wait for a vblank at
* some point . And we wait before enabling FBC anyway .
*/
2015-10-15 21:44:46 +08:00
DRM_DEBUG_KMS ( " deactivating FBC for update \n " ) ;
__intel_fbc_deactivate ( dev_priv ) ;
2014-12-09 00:09:10 +08:00
}
2015-10-15 04:45:36 +08:00
intel_fbc_schedule_activation ( crtc ) ;
2016-01-12 03:44:36 +08:00
fbc - > no_fbc_reason = " FBC enabled (not necessarily active) " ;
2014-12-09 00:09:10 +08:00
return ;
out_disable :
/* Multiple disables should be harmless */
2015-10-15 04:45:36 +08:00
if ( intel_fbc_is_active ( dev_priv ) ) {
2015-10-15 21:44:46 +08:00
DRM_DEBUG_KMS ( " unsupported config, deactivating FBC \n " ) ;
__intel_fbc_deactivate ( dev_priv ) ;
2014-12-09 00:09:10 +08:00
}
2015-07-03 06:25:10 +08:00
}
/*
2015-10-15 21:44:46 +08:00
* intel_fbc_update - activate / deactivate FBC as needed
2015-10-14 06:13:25 +08:00
* @ crtc : the CRTC that triggered the update
2015-07-03 06:25:10 +08:00
*
2015-10-15 21:44:46 +08:00
* This function reevaluates the overall state and activates or deactivates FBC .
2015-07-03 06:25:10 +08:00
*/
2015-10-14 06:13:25 +08:00
void intel_fbc_update ( struct intel_crtc * crtc )
2015-07-03 06:25:10 +08:00
{
2015-10-14 06:13:25 +08:00
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-10-14 06:13:25 +08:00
2015-09-23 23:52:27 +08:00
if ( ! fbc_supported ( dev_priv ) )
2015-07-04 02:40:54 +08:00
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2015-10-14 06:13:25 +08:00
__intel_fbc_update ( crtc ) ;
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2014-12-09 00:09:10 +08:00
}
2015-02-14 03:23:46 +08:00
void intel_fbc_invalidate ( struct drm_i915_private * dev_priv ,
unsigned int frontbuffer_bits ,
enum fb_op_origin origin )
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-02-14 03:23:46 +08:00
unsigned int fbc_bits ;
2015-09-23 23:52:27 +08:00
if ( ! fbc_supported ( dev_priv ) )
2015-07-04 02:40:54 +08:00
return ;
2015-02-14 03:23:46 +08:00
if ( origin = = ORIGIN_GTT )
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2015-07-03 06:25:10 +08:00
2016-01-12 03:44:36 +08:00
if ( fbc - > enabled )
fbc_bits = INTEL_FRONTBUFFER_PRIMARY ( fbc - > crtc - > pipe ) ;
2015-02-14 03:23:46 +08:00
else
2016-01-12 03:44:36 +08:00
fbc_bits = fbc - > possible_framebuffer_bits ;
2015-02-14 03:23:46 +08:00
2016-01-12 03:44:36 +08:00
fbc - > busy_bits | = ( fbc_bits & frontbuffer_bits ) ;
2015-02-14 03:23:46 +08:00
2016-01-12 03:44:36 +08:00
if ( fbc - > busy_bits )
2015-10-15 21:44:46 +08:00
__intel_fbc_deactivate ( dev_priv ) ;
2015-07-03 06:25:10 +08:00
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-02-14 03:23:46 +08:00
}
void intel_fbc_flush ( struct drm_i915_private * dev_priv ,
2015-07-15 03:29:10 +08:00
unsigned int frontbuffer_bits , enum fb_op_origin origin )
2015-02-14 03:23:46 +08:00
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-09-23 23:52:27 +08:00
if ( ! fbc_supported ( dev_priv ) )
2015-07-04 02:40:54 +08:00
return ;
2015-07-15 03:29:10 +08:00
if ( origin = = ORIGIN_GTT )
return ;
2015-07-03 06:25:10 +08:00
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2015-02-14 03:23:46 +08:00
2016-01-12 03:44:36 +08:00
fbc - > busy_bits & = ~ frontbuffer_bits ;
2015-02-14 03:23:46 +08:00
2016-01-12 03:44:36 +08:00
if ( ! fbc - > busy_bits & & fbc - > enabled ) {
if ( origin ! = ORIGIN_FLIP & & fbc - > active ) {
2015-11-12 00:46:22 +08:00
intel_fbc_recompress ( dev_priv ) ;
} else {
__intel_fbc_deactivate ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
__intel_fbc_update ( fbc - > crtc ) ;
2015-11-12 00:46:22 +08:00
}
2015-07-15 03:29:10 +08:00
}
2015-07-03 06:25:10 +08:00
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-02-14 03:23:46 +08:00
}
2015-10-15 21:44:46 +08:00
/**
* intel_fbc_enable : tries to enable FBC on the CRTC
* @ crtc : the CRTC
*
* This function checks if it ' s possible to enable FBC on the following CRTC ,
* then enables it . Notice that it doesn ' t activate FBC .
*/
void intel_fbc_enable ( struct intel_crtc * crtc )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-10-15 21:44:46 +08:00
if ( ! fbc_supported ( dev_priv ) )
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
2015-10-15 21:44:46 +08:00
2016-01-12 03:44:36 +08:00
if ( fbc - > enabled ) {
WARN_ON ( fbc - > crtc = = crtc ) ;
2015-10-15 21:44:46 +08:00
goto out ;
}
2016-01-12 03:44:36 +08:00
WARN_ON ( fbc - > active ) ;
WARN_ON ( fbc - > crtc ! = NULL ) ;
2015-10-15 21:44:46 +08:00
2016-01-19 21:35:36 +08:00
if ( ! intel_fbc_can_enable ( crtc ) )
2015-10-15 21:44:46 +08:00
goto out ;
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
if ( intel_fbc_alloc_cfb ( crtc ) ) {
set_no_fbc_reason ( dev_priv , " not enough stolen memory " ) ;
goto out ;
}
2015-10-15 21:44:46 +08:00
DRM_DEBUG_KMS ( " Enabling FBC on pipe %c \n " , pipe_name ( crtc - > pipe ) ) ;
2016-01-12 03:44:36 +08:00
fbc - > no_fbc_reason = " FBC enabled but not active yet \n " ;
2015-10-15 21:44:46 +08:00
2016-01-12 03:44:36 +08:00
fbc - > enabled = true ;
fbc - > crtc = crtc ;
2015-10-15 21:44:46 +08:00
out :
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-10-15 21:44:46 +08:00
}
/**
* __intel_fbc_disable - disable FBC
* @ dev_priv : i915 device instance
*
* This is the low level function that actually disables FBC . Callers should
* grab the FBC lock .
*/
static void __intel_fbc_disable ( struct drm_i915_private * dev_priv )
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
struct intel_crtc * crtc = fbc - > crtc ;
2015-10-15 21:44:46 +08:00
2016-01-12 03:44:36 +08:00
WARN_ON ( ! mutex_is_locked ( & fbc - > lock ) ) ;
WARN_ON ( ! fbc - > enabled ) ;
WARN_ON ( fbc - > active ) ;
2015-10-15 21:44:46 +08:00
assert_pipe_disabled ( dev_priv , crtc - > pipe ) ;
DRM_DEBUG_KMS ( " Disabling FBC on pipe %c \n " , pipe_name ( crtc - > pipe ) ) ;
drm/i915: alloc/free the FBC CFB during enable/disable
One of the problems with the current code is that it frees the CFB and
releases its drm_mm node as soon as we flip FBC's enable bit. This is
bad because after we disable FBC the hardware may still use the CFB
for the rest of the frame, so in theory we should only release the
drm_mm node one frame after we disable FBC. Otherwise, a stolen memory
allocation done right after an FBC disable may result in either
corrupted memory for the new owner of that memory region or corrupted
screen/underruns in case the new owner changes it while the hardware
is still reading it. This case is not exactly easy to reproduce since
we currently don't do a lot of stolen memory allocations, but I see
patches on the mailing list trying to expose stolen memory to user
space, so races will be possible.
I thought about three different approaches to solve this, and they all
have downsides.
The first approach would be to simply use multiple drm_mm nodes and
freeing the unused ones only after a frame has passed. The problem
with this approach is that since stolen memory is rather small,
there's a risk we just won't be able to allocate a new CFB from stolen
if the previous one was not freed yet. This could happen in case we
quickly disable FBC from pipe A and decide to enable it on pipe B, or
just if we change pipe A's fb stride while FBC is enabled.
The second approach would be similar to the first one, but maintaining
a single drm_mm node and keeping track of when it can be reused. This
would remove the disadvantage of not having enough space for two
nodes, but would create the new problem where we may not be able to
enable FBC at the point intel_fbc_update() is called, so we would have
to add more code to retry updating FBC after the time has passed. And
that can quickly get too complex since we can get invalidate, flush,
disable and other calls in the middle of the wait.
Both solutions above - and also the current code - have the problem
that we unnecessarily free+realloc FBC during invalidate+flush
operations even if the CFB size doesn't change.
The third option would be to move the allocation/deallocation to
enable/disable. This makes sure that the pipe is always disabled when
we allocate/deallocate the CFB, so there's no risk that the FBC
hardware may read or write to the memory right after it is freed from
drm_mm. The downside is that it is possible for user space to change
the buffer stride without triggering a disable/enable - only
deactivate/activate -, so we'll have to handle this case somehow - see
igt's kms_frontbuffer_tracking test, fbc-stridechange subtest. It
could be possible to implement a way to free+alloc the CFB during said
stride change, but it would involve a lot of book-keeping - exactly as
mentioned above - just for on case, so for now I'll keep it simple and
just deactivate FBC. Besides, we may not even need to disable FBC
since we do CFB over-allocation.
Note from Chris: "Starting a fullscreen client that covers a single
monitor in a multi-monitor setup will trigger a change in stride on
one of the CRTCs (the monitors will be flipped independently).". It
shouldn't be a huge problem if we lose FBC on multi-monitor setups
since these setups already have problems reaching deep PC states
anyway.
v2: Rebase after changing the patch order.
v3:
- Remove references to the stride change case being "uncommon" and
paste Chris' example.
- Rebase after a change in a previous patch.
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com>
Link: http://patchwork.freedesktop.org/patch/msgid/
2015-10-16 01:19:21 +08:00
__intel_fbc_cleanup_cfb ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
fbc - > enabled = false ;
fbc - > crtc = NULL ;
2015-10-15 21:44:46 +08:00
}
/**
* intel_fbc_disable_crtc - disable FBC if it ' s associated with crtc
* @ crtc : the CRTC
*
* This function disables FBC if it ' s associated with the provided CRTC .
*/
void intel_fbc_disable_crtc ( struct intel_crtc * crtc )
{
struct drm_i915_private * dev_priv = crtc - > base . dev - > dev_private ;
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-10-15 21:44:46 +08:00
if ( ! fbc_supported ( dev_priv ) )
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
if ( fbc - > crtc = = crtc ) {
WARN_ON ( ! fbc - > enabled ) ;
WARN_ON ( fbc - > active ) ;
2015-10-15 21:44:46 +08:00
__intel_fbc_disable ( dev_priv ) ;
}
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-10-15 21:44:46 +08:00
}
/**
* intel_fbc_disable - globally disable FBC
* @ dev_priv : i915 device instance
*
* This function disables FBC regardless of which CRTC is associated with it .
*/
void intel_fbc_disable ( struct drm_i915_private * dev_priv )
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-10-15 21:44:46 +08:00
if ( ! fbc_supported ( dev_priv ) )
return ;
2016-01-12 03:44:36 +08:00
mutex_lock ( & fbc - > lock ) ;
if ( fbc - > enabled )
2015-10-15 21:44:46 +08:00
__intel_fbc_disable ( dev_priv ) ;
2016-01-12 03:44:36 +08:00
mutex_unlock ( & fbc - > lock ) ;
2015-10-15 21:44:46 +08:00
}
2014-12-08 22:46:31 +08:00
/**
* intel_fbc_init - Initialize FBC
* @ dev_priv : the i915 device
*
* This function might be called during PM init process .
*/
2014-12-09 00:09:10 +08:00
void intel_fbc_init ( struct drm_i915_private * dev_priv )
{
2016-01-12 03:44:36 +08:00
struct intel_fbc * fbc = & dev_priv - > fbc ;
2015-02-14 03:23:46 +08:00
enum pipe pipe ;
2016-01-12 03:44:36 +08:00
INIT_WORK ( & fbc - > work . work , intel_fbc_work_fn ) ;
mutex_init ( & fbc - > lock ) ;
fbc - > enabled = false ;
fbc - > active = false ;
fbc - > work . scheduled = false ;
2015-07-03 06:25:10 +08:00
2014-12-09 00:09:10 +08:00
if ( ! HAS_FBC ( dev_priv ) ) {
2016-01-12 03:44:36 +08:00
fbc - > no_fbc_reason = " unsupported by this chipset " ;
2014-12-09 00:09:10 +08:00
return ;
}
2015-02-14 03:23:46 +08:00
for_each_pipe ( dev_priv , pipe ) {
2016-01-12 03:44:36 +08:00
fbc - > possible_framebuffer_bits | =
2015-02-14 03:23:46 +08:00
INTEL_FRONTBUFFER_PRIMARY ( pipe ) ;
2015-11-05 03:10:46 +08:00
if ( fbc_on_pipe_a_only ( dev_priv ) )
2015-02-14 03:23:46 +08:00
break ;
}
2014-12-09 00:09:10 +08:00
if ( INTEL_INFO ( dev_priv ) - > gen > = 7 ) {
2016-01-12 03:44:36 +08:00
fbc - > is_active = ilk_fbc_is_active ;
fbc - > activate = gen7_fbc_activate ;
fbc - > deactivate = ilk_fbc_deactivate ;
2014-12-09 00:09:10 +08:00
} else if ( INTEL_INFO ( dev_priv ) - > gen > = 5 ) {
2016-01-12 03:44:36 +08:00
fbc - > is_active = ilk_fbc_is_active ;
fbc - > activate = ilk_fbc_activate ;
fbc - > deactivate = ilk_fbc_deactivate ;
2014-12-09 00:09:10 +08:00
} else if ( IS_GM45 ( dev_priv ) ) {
2016-01-12 03:44:36 +08:00
fbc - > is_active = g4x_fbc_is_active ;
fbc - > activate = g4x_fbc_activate ;
fbc - > deactivate = g4x_fbc_deactivate ;
2014-12-09 00:09:10 +08:00
} else {
2016-01-12 03:44:36 +08:00
fbc - > is_active = i8xx_fbc_is_active ;
fbc - > activate = i8xx_fbc_activate ;
fbc - > deactivate = i8xx_fbc_deactivate ;
2014-12-09 00:09:10 +08:00
/* This value was pulled out of someone's hat */
I915_WRITE ( FBC_CONTROL , 500 < < FBC_CTL_INTERVAL_SHIFT ) ;
}
2015-11-05 03:10:52 +08:00
/* We still don't have any sort of hardware state readout for FBC, so
2015-10-15 04:45:36 +08:00
* deactivate it in case the BIOS activated it to make sure software
* matches the hardware state . */
2016-01-12 03:44:36 +08:00
if ( fbc - > is_active ( dev_priv ) )
fbc - > deactivate ( dev_priv ) ;
2014-12-09 00:09:10 +08:00
}