lockdep-splat.txt 4.8 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110
  1. Lockdep-RCU was added to the Linux kernel in early 2010
  2. (http://lwn.net/Articles/371986/). This facility checks for some common
  3. misuses of the RCU API, most notably using one of the rcu_dereference()
  4. family to access an RCU-protected pointer without the proper protection.
  5. When such misuse is detected, an lockdep-RCU splat is emitted.
  6. The usual cause of a lockdep-RCU slat is someone accessing an
  7. RCU-protected data structure without either (1) being in the right kind of
  8. RCU read-side critical section or (2) holding the right update-side lock.
  9. This problem can therefore be serious: it might result in random memory
  10. overwriting or worse. There can of course be false positives, this
  11. being the real world and all that.
  12. So let's look at an example RCU lockdep splat from 3.0-rc5, one that
  13. has long since been fixed:
  14. ===============================
  15. [ INFO: suspicious RCU usage. ]
  16. -------------------------------
  17. block/cfq-iosched.c:2776 suspicious rcu_dereference_protected() usage!
  18. other info that might help us debug this:
  19. rcu_scheduler_active = 1, debug_locks = 0
  20. 3 locks held by scsi_scan_6/1552:
  21. #0: (&shost->scan_mutex){+.+.+.}, at: [<ffffffff8145efca>]
  22. scsi_scan_host_selected+0x5a/0x150
  23. #1: (&eq->sysfs_lock){+.+...}, at: [<ffffffff812a5032>]
  24. elevator_exit+0x22/0x60
  25. #2: (&(&q->__queue_lock)->rlock){-.-...}, at: [<ffffffff812b6233>]
  26. cfq_exit_queue+0x43/0x190
  27. stack backtrace:
  28. Pid: 1552, comm: scsi_scan_6 Not tainted 3.0.0-rc5 #17
  29. Call Trace:
  30. [<ffffffff810abb9b>] lockdep_rcu_dereference+0xbb/0xc0
  31. [<ffffffff812b6139>] __cfq_exit_single_io_context+0xe9/0x120
  32. [<ffffffff812b626c>] cfq_exit_queue+0x7c/0x190
  33. [<ffffffff812a5046>] elevator_exit+0x36/0x60
  34. [<ffffffff812a802a>] blk_cleanup_queue+0x4a/0x60
  35. [<ffffffff8145cc09>] scsi_free_queue+0x9/0x10
  36. [<ffffffff81460944>] __scsi_remove_device+0x84/0xd0
  37. [<ffffffff8145dca3>] scsi_probe_and_add_lun+0x353/0xb10
  38. [<ffffffff817da069>] ? error_exit+0x29/0xb0
  39. [<ffffffff817d98ed>] ? _raw_spin_unlock_irqrestore+0x3d/0x80
  40. [<ffffffff8145e722>] __scsi_scan_target+0x112/0x680
  41. [<ffffffff812c690d>] ? trace_hardirqs_off_thunk+0x3a/0x3c
  42. [<ffffffff817da069>] ? error_exit+0x29/0xb0
  43. [<ffffffff812bcc60>] ? kobject_del+0x40/0x40
  44. [<ffffffff8145ed16>] scsi_scan_channel+0x86/0xb0
  45. [<ffffffff8145f0b0>] scsi_scan_host_selected+0x140/0x150
  46. [<ffffffff8145f149>] do_scsi_scan_host+0x89/0x90
  47. [<ffffffff8145f170>] do_scan_async+0x20/0x160
  48. [<ffffffff8145f150>] ? do_scsi_scan_host+0x90/0x90
  49. [<ffffffff810975b6>] kthread+0xa6/0xb0
  50. [<ffffffff817db154>] kernel_thread_helper+0x4/0x10
  51. [<ffffffff81066430>] ? finish_task_switch+0x80/0x110
  52. [<ffffffff817d9c04>] ? retint_restore_args+0xe/0xe
  53. [<ffffffff81097510>] ? __init_kthread_worker+0x70/0x70
  54. [<ffffffff817db150>] ? gs_change+0xb/0xb
  55. Line 2776 of block/cfq-iosched.c in v3.0-rc5 is as follows:
  56. if (rcu_dereference(ioc->ioc_data) == cic) {
  57. This form says that it must be in a plain vanilla RCU read-side critical
  58. section, but the "other info" list above shows that this is not the
  59. case. Instead, we hold three locks, one of which might be RCU related.
  60. And maybe that lock really does protect this reference. If so, the fix
  61. is to inform RCU, perhaps by changing __cfq_exit_single_io_context() to
  62. take the struct request_queue "q" from cfq_exit_queue() as an argument,
  63. which would permit us to invoke rcu_dereference_protected as follows:
  64. if (rcu_dereference_protected(ioc->ioc_data,
  65. lockdep_is_held(&q->queue_lock)) == cic) {
  66. With this change, there would be no lockdep-RCU splat emitted if this
  67. code was invoked either from within an RCU read-side critical section
  68. or with the ->queue_lock held. In particular, this would have suppressed
  69. the above lockdep-RCU splat because ->queue_lock is held (see #2 in the
  70. list above).
  71. On the other hand, perhaps we really do need an RCU read-side critical
  72. section. In this case, the critical section must span the use of the
  73. return value from rcu_dereference(), or at least until there is some
  74. reference count incremented or some such. One way to handle this is to
  75. add rcu_read_lock() and rcu_read_unlock() as follows:
  76. rcu_read_lock();
  77. if (rcu_dereference(ioc->ioc_data) == cic) {
  78. spin_lock(&ioc->lock);
  79. rcu_assign_pointer(ioc->ioc_data, NULL);
  80. spin_unlock(&ioc->lock);
  81. }
  82. rcu_read_unlock();
  83. With this change, the rcu_dereference() is always within an RCU
  84. read-side critical section, which again would have suppressed the
  85. above lockdep-RCU splat.
  86. But in this particular case, we don't actually deference the pointer
  87. returned from rcu_dereference(). Instead, that pointer is just compared
  88. to the cic pointer, which means that the rcu_dereference() can be replaced
  89. by rcu_access_pointer() as follows:
  90. if (rcu_access_pointer(ioc->ioc_data) == cic) {
  91. Because it is legal to invoke rcu_access_pointer() without protection,
  92. this change would also suppress the above lockdep-RCU splat.