selection-api.xml 13 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324
  1. <section id="selection-api">
  2. <title>Experimental API for cropping, composing and scaling</title>
  3. <note>
  4. <title>Experimental</title>
  5. <para>This is an <link linkend="experimental">experimental</link>
  6. interface and may change in the future.</para>
  7. </note>
  8. <section>
  9. <title>Introduction</title>
  10. <para>Some video capture devices can sample a subsection of a picture and
  11. shrink or enlarge it to an image of arbitrary size. Next, the devices can
  12. insert the image into larger one. Some video output devices can crop part of an
  13. input image, scale it up or down and insert it at an arbitrary scan line and
  14. horizontal offset into a video signal. We call these abilities cropping,
  15. scaling and composing.</para>
  16. <para>On a video <emphasis>capture</emphasis> device the source is a video
  17. signal, and the cropping target determine the area actually sampled. The sink
  18. is an image stored in a memory buffer. The composing area specifies which part
  19. of the buffer is actually written to by the hardware. </para>
  20. <para>On a video <emphasis>output</emphasis> device the source is an image in a
  21. memory buffer, and the cropping target is a part of an image to be shown on a
  22. display. The sink is the display or the graphics screen. The application may
  23. select the part of display where the image should be displayed. The size and
  24. position of such a window is controlled by the compose target.</para>
  25. <para>Rectangles for all cropping and composing targets are defined even if the
  26. device does supports neither cropping nor composing. Their size and position
  27. will be fixed in such a case. If the device does not support scaling then the
  28. cropping and composing rectangles have the same size.</para>
  29. </section>
  30. <section>
  31. <title>Selection targets</title>
  32. <para>
  33. <figure id="sel-targets-capture">
  34. <title>Cropping and composing targets</title>
  35. <mediaobject>
  36. <imageobject>
  37. <imagedata fileref="selection.png" format="PNG" />
  38. </imageobject>
  39. <textobject>
  40. <phrase>Targets used by a cropping, composing and scaling
  41. process</phrase>
  42. </textobject>
  43. </mediaobject>
  44. </figure>
  45. </para>
  46. <para>See <xref linkend="v4l2-selection-targets" /> for more
  47. information.</para>
  48. </section>
  49. <section>
  50. <title>Configuration</title>
  51. <para>Applications can use the <link linkend="vidioc-g-selection">selection
  52. API</link> to select an area in a video signal or a buffer, and to query for
  53. default settings and hardware limits.</para>
  54. <para>Video hardware can have various cropping, composing and scaling
  55. limitations. It may only scale up or down, support only discrete scaling
  56. factors, or have different scaling abilities in the horizontal and vertical
  57. directions. Also it may not support scaling at all. At the same time the
  58. cropping/composing rectangles may have to be aligned, and both the source and
  59. the sink may have arbitrary upper and lower size limits. Therefore, as usual,
  60. drivers are expected to adjust the requested parameters and return the actual
  61. values selected. An application can control the rounding behaviour using <link
  62. linkend="v4l2-selection-flags"> constraint flags </link>.</para>
  63. <section>
  64. <title>Configuration of video capture</title>
  65. <para>See figure <xref linkend="sel-targets-capture" /> for examples of the
  66. selection targets available for a video capture device. It is recommended to
  67. configure the cropping targets before to the composing targets.</para>
  68. <para>The range of coordinates of the top left corner, width and height of
  69. areas that can be sampled is given by the <constant>V4L2_SEL_TGT_CROP_BOUNDS</constant>
  70. target. It is recommended for the driver developers to put the
  71. top/left corner at position <constant>(0,0)</constant>. The rectangle's
  72. coordinates are expressed in pixels.</para>
  73. <para>The top left corner, width and height of the source rectangle, that is
  74. the area actually sampled, is given by the <constant>V4L2_SEL_TGT_CROP</constant>
  75. target. It uses the same coordinate system as <constant>V4L2_SEL_TGT_CROP_BOUNDS</constant>.
  76. The active cropping area must lie completely inside the capture boundaries. The
  77. driver may further adjust the requested size and/or position according to hardware
  78. limitations.</para>
  79. <para>Each capture device has a default source rectangle, given by the
  80. <constant>V4L2_SEL_TGT_CROP_DEFAULT</constant> target. This rectangle shall
  81. over what the driver writer considers the complete picture. Drivers shall set
  82. the active crop rectangle to the default when the driver is first loaded, but
  83. not later.</para>
  84. <para>The composing targets refer to a memory buffer. The limits of composing
  85. coordinates are obtained using <constant>V4L2_SEL_TGT_COMPOSE_BOUNDS</constant>.
  86. All coordinates are expressed in pixels. The rectangle's top/left
  87. corner must be located at position <constant>(0,0)</constant>. The width and
  88. height are equal to the image size set by <constant>VIDIOC_S_FMT</constant>.
  89. </para>
  90. <para>The part of a buffer into which the image is inserted by the hardware is
  91. controlled by the <constant>V4L2_SEL_TGT_COMPOSE</constant> target.
  92. The rectangle's coordinates are also expressed in the same coordinate system as
  93. the bounds rectangle. The composing rectangle must lie completely inside bounds
  94. rectangle. The driver must adjust the composing rectangle to fit to the
  95. bounding limits. Moreover, the driver can perform other adjustments according
  96. to hardware limitations. The application can control rounding behaviour using
  97. <link linkend="v4l2-selection-flags"> constraint flags</link>.</para>
  98. <para>For capture devices the default composing rectangle is queried using
  99. <constant>V4L2_SEL_TGT_COMPOSE_DEFAULT</constant>. It is usually equal to the
  100. bounding rectangle.</para>
  101. <para>The part of a buffer that is modified by the hardware is given by
  102. <constant>V4L2_SEL_TGT_COMPOSE_PADDED</constant>. It contains all pixels
  103. defined using <constant>V4L2_SEL_TGT_COMPOSE</constant> plus all
  104. padding data modified by hardware during insertion process. All pixels outside
  105. this rectangle <emphasis>must not</emphasis> be changed by the hardware. The
  106. content of pixels that lie inside the padded area but outside active area is
  107. undefined. The application can use the padded and active rectangles to detect
  108. where the rubbish pixels are located and remove them if needed.</para>
  109. </section>
  110. <section>
  111. <title>Configuration of video output</title>
  112. <para>For output devices targets and ioctls are used similarly to the video
  113. capture case. The <emphasis>composing</emphasis> rectangle refers to the
  114. insertion of an image into a video signal. The cropping rectangles refer to a
  115. memory buffer. It is recommended to configure the composing targets before to
  116. the cropping targets.</para>
  117. <para>The cropping targets refer to the memory buffer that contains an image to
  118. be inserted into a video signal or graphical screen. The limits of cropping
  119. coordinates are obtained using <constant>V4L2_SEL_TGT_CROP_BOUNDS</constant>.
  120. All coordinates are expressed in pixels. The top/left corner is always point
  121. <constant>(0,0)</constant>. The width and height is equal to the image size
  122. specified using <constant>VIDIOC_S_FMT</constant> ioctl.</para>
  123. <para>The top left corner, width and height of the source rectangle, that is
  124. the area from which image date are processed by the hardware, is given by the
  125. <constant>V4L2_SEL_TGT_CROP</constant>. Its coordinates are expressed
  126. in in the same coordinate system as the bounds rectangle. The active cropping
  127. area must lie completely inside the crop boundaries and the driver may further
  128. adjust the requested size and/or position according to hardware
  129. limitations.</para>
  130. <para>For output devices the default cropping rectangle is queried using
  131. <constant>V4L2_SEL_TGT_CROP_DEFAULT</constant>. It is usually equal to the
  132. bounding rectangle.</para>
  133. <para>The part of a video signal or graphics display where the image is
  134. inserted by the hardware is controlled by <constant>V4L2_SEL_TGT_COMPOSE</constant>
  135. target. The rectangle's coordinates are expressed in pixels. The composing
  136. rectangle must lie completely inside the bounds rectangle. The driver must
  137. adjust the area to fit to the bounding limits. Moreover, the driver can
  138. perform other adjustments according to hardware limitations.</para>
  139. <para>The device has a default composing rectangle, given by the
  140. <constant>V4L2_SEL_TGT_COMPOSE_DEFAULT</constant> target. This rectangle shall cover what
  141. the driver writer considers the complete picture. It is recommended for the
  142. driver developers to put the top/left corner at position <constant>(0,0)</constant>.
  143. Drivers shall set the active composing rectangle to the default
  144. one when the driver is first loaded.</para>
  145. <para>The devices may introduce additional content to video signal other than
  146. an image from memory buffers. It includes borders around an image. However,
  147. such a padded area is driver-dependent feature not covered by this document.
  148. Driver developers are encouraged to keep padded rectangle equal to active one.
  149. The padded target is accessed by the <constant>V4L2_SEL_TGT_COMPOSE_PADDED</constant>
  150. identifier. It must contain all pixels from the <constant>V4L2_SEL_TGT_COMPOSE</constant>
  151. target.</para>
  152. </section>
  153. <section>
  154. <title>Scaling control</title>
  155. <para>An application can detect if scaling is performed by comparing the width
  156. and the height of rectangles obtained using <constant>V4L2_SEL_TGT_CROP</constant>
  157. and <constant>V4L2_SEL_TGT_COMPOSE</constant> targets. If
  158. these are not equal then the scaling is applied. The application can compute
  159. the scaling ratios using these values.</para>
  160. </section>
  161. </section>
  162. <section>
  163. <title>Comparison with old cropping API</title>
  164. <para>The selection API was introduced to cope with deficiencies of previous
  165. <link linkend="crop"> API</link>, that was designed to control simple capture
  166. devices. Later the cropping API was adopted by video output drivers. The ioctls
  167. are used to select a part of the display were the video signal is inserted. It
  168. should be considered as an API abuse because the described operation is
  169. actually the composing. The selection API makes a clear distinction between
  170. composing and cropping operations by setting the appropriate targets. The V4L2
  171. API lacks any support for composing to and cropping from an image inside a
  172. memory buffer. The application could configure a capture device to fill only a
  173. part of an image by abusing V4L2 API. Cropping a smaller image from a larger
  174. one is achieved by setting the field
  175. &v4l2-pix-format;<structfield>::bytesperline</structfield>. Introducing an image offsets
  176. could be done by modifying field &v4l2-buffer;<structfield>::m_userptr</structfield>
  177. before calling <constant>VIDIOC_QBUF</constant>. Those
  178. operations should be avoided because they are not portable (endianness), and do
  179. not work for macroblock and Bayer formats and mmap buffers. The selection API
  180. deals with configuration of buffer cropping/composing in a clear, intuitive and
  181. portable way. Next, with the selection API the concepts of the padded target
  182. and constraints flags are introduced. Finally, &v4l2-crop; and &v4l2-cropcap;
  183. have no reserved fields. Therefore there is no way to extend their functionality.
  184. The new &v4l2-selection; provides a lot of place for future
  185. extensions. Driver developers are encouraged to implement only selection API.
  186. The former cropping API would be simulated using the new one.</para>
  187. </section>
  188. <section>
  189. <title>Examples</title>
  190. <example>
  191. <title>Resetting the cropping parameters</title>
  192. <para>(A video capture device is assumed; change
  193. <constant>V4L2_BUF_TYPE_VIDEO_CAPTURE</constant> for other devices; change target to
  194. <constant>V4L2_SEL_TGT_COMPOSE_*</constant> family to configure composing
  195. area)</para>
  196. <programlisting>
  197. &v4l2-selection; sel = {
  198. .type = V4L2_BUF_TYPE_VIDEO_CAPTURE,
  199. .target = V4L2_SEL_TGT_CROP_DEFAULT,
  200. };
  201. ret = ioctl(fd, &VIDIOC-G-SELECTION;, &amp;sel);
  202. if (ret)
  203. exit(-1);
  204. sel.target = V4L2_SEL_TGT_CROP;
  205. ret = ioctl(fd, &VIDIOC-S-SELECTION;, &amp;sel);
  206. if (ret)
  207. exit(-1);
  208. </programlisting>
  209. </example>
  210. <example>
  211. <title>Simple downscaling</title>
  212. <para>Setting a composing area on output of size of <emphasis> at most
  213. </emphasis> half of limit placed at a center of a display.</para>
  214. <programlisting>
  215. &v4l2-selection; sel = {
  216. .type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
  217. .target = V4L2_SEL_TGT_COMPOSE_BOUNDS,
  218. };
  219. struct v4l2_rect r;
  220. ret = ioctl(fd, &VIDIOC-G-SELECTION;, &amp;sel);
  221. if (ret)
  222. exit(-1);
  223. /* setting smaller compose rectangle */
  224. r.width = sel.r.width / 2;
  225. r.height = sel.r.height / 2;
  226. r.left = sel.r.width / 4;
  227. r.top = sel.r.height / 4;
  228. sel.r = r;
  229. sel.target = V4L2_SEL_TGT_COMPOSE;
  230. sel.flags = V4L2_SEL_FLAG_LE;
  231. ret = ioctl(fd, &VIDIOC-S-SELECTION;, &amp;sel);
  232. if (ret)
  233. exit(-1);
  234. </programlisting>
  235. </example>
  236. <example>
  237. <title>Querying for scaling factors</title>
  238. <para>A video output device is assumed; change
  239. <constant>V4L2_BUF_TYPE_VIDEO_OUTPUT</constant> for other devices</para>
  240. <programlisting>
  241. &v4l2-selection; compose = {
  242. .type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
  243. .target = V4L2_SEL_TGT_COMPOSE,
  244. };
  245. &v4l2-selection; crop = {
  246. .type = V4L2_BUF_TYPE_VIDEO_OUTPUT,
  247. .target = V4L2_SEL_TGT_CROP,
  248. };
  249. double hscale, vscale;
  250. ret = ioctl(fd, &VIDIOC-G-SELECTION;, &amp;compose);
  251. if (ret)
  252. exit(-1);
  253. ret = ioctl(fd, &VIDIOC-G-SELECTION;, &amp;crop);
  254. if (ret)
  255. exit(-1);
  256. /* computing scaling factors */
  257. hscale = (double)compose.r.width / crop.r.width;
  258. vscale = (double)compose.r.height / crop.r.height;
  259. </programlisting>
  260. </example>
  261. </section>
  262. </section>