Image Quality Benchmark of Computational Bokeh

Image quality benchmark of computational bokeh Wolf Hauser, Balthazar Neveu, Jean-Benoit Jourdain, Clement´ Viard, Fred´ eric´ Guichard DxOMark Image Labs, 3 rue Nationale, 92100 Boulogne-Billancourt FRANCE Abstract (a) (b) Smartphone cameras have progressed a lot during recent years and even caught up with entry-level DSLR cameras in many standard situations. One domain where the difference remained obvious was portrait photography. Now smartphone manufacturers equip their flagship models with special modes where they computationally simulate shallow depth of field. We propose a method to quantitatively evaluate the quality of such computational bokeh in a reproducible way, focusing on both the quality of the bokeh (depth of field, shape), as well as on artifacts brought by the challenge to accurately differentiate the face of a subject from the background, especially on complex tran- sitions such as curly hairs. Depth of field simulation is a complex topic and standard metrics for out-of-focus blur do not currently (c) (d) exist. The proposed method is based on perceptual, systematic analysis of pictures shot in our lab. We show that the depth of field of the best mobile devices is as shallow as that of DSLRs, but also reveal processing artifacts that are inexistent on DSLRs. Our primary goal is to help customers comparing smartphone cameras among each other and to DSLRs. We also hope that our method will guide smartphone makers in their developments and will thereby contribute to ad- vancing mobile portrait photography. Keywords: Image quality evaluation, benchmark, portrait photography, depth of field, bokeh, smartphone. Introduction Figure 1. Portraits taken with a consumer DSLR (a) and a smartphone (b–d). Top: large field of view and depth of field cause the back- Even some professional photographers and photojournalists ground to appear very detailed in the smartphone picture, drawing attention use smartphone cameras for documentary, landscape and street away from the subject. Bottom: the telephoto lens (c) improves perspec- photography. In these disciplines, the quality of recent high- tive, but the background remains sharp. Computational bokeh (d) allows the end smartphones is—sometimes—indistinguishable from that of smartphone to deliver result rather close to that of the DSLR. DSLRs. In portrait photography, however, even novices can im- mediately distinguish a smartphone picture and one taken with a DSLR. Figure 1 shows such a comparison. tocol. We start by presenting related work and go on to describing There are two major differences between the smartphone and our proposal and the results we obtain. the DSLR used for taking the pictures in figure 1: perspective and depth of field. We will look at these phenomenons in more detail in the following subsection and recall how they are related to the Perspective, depth of field and why size matters difference in size between the smartphone camera and the DSLR. Perspective is how three-dimensional objects appear in two- We will then have a short look at how smartphone manufacturers dimensional images and it depends on the viewing point. In fig- attempt to overcome these limits by narrowing the depth of field ure 1, the photographer framed the subject so that its face covers 1 computationally. ⁄3 of the image width in portrait orientation. For this, he had to Given user’s interest in image quality and the strong com- stand at approximately 6 ft from the subject for the DSLR and at petition among manufacturers, it seems obvious that smartphone only 2 ft for the smartphone. image quality evaluation should benchmark these attempts. In the The relationship between subject distance and focal length is s0 rest of this paper we propose a method and laboratory setup for shown in figure 2. From intersecting lines, it follows s = h h0 for evaluating the quality of shallow depth of field simulations. The the subject distance, where s0 is the image distance and h and h0 proposed method is used as part of our DxOMark Mobile test pro- are the object and image heights, respectively. Copyright 2018 IS&T. One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper are prohibited. DOF D h h' H C Subject plane s s' s s' ≈ f N N s s' Figure 2. Focal length f and sensor height H define the subject distance s required for a certain framing, i.e. for a certain ratio between image height h0 sF s'F and sensor height H. Figure 3. The maximum acceptable circle of confusion C defines the near- est and farthest points in the scene that appear sharp on the image. For portrait distances, the image distance is close to the focal length f and we can suppose s0 ≈ f . In our example, we where D is the aperture diameter. Utilizing the thin lens equa- 0 1 f wanted h to be ⁄3 of the sensor height H. It follows that s ≈ 3h . 0 H tion 1=s + 1=s = 1= f and solving for sN and sF , respectively, we The object height is fix and the same for both cameras. The ra- obtain tio between focal length and sensor height, however, is different: 50 mm : 15 mm for the DSLR, only 4 mm : 3.6 mm for the smart- sD f sD f s = and s = : (1) phone. Photographers are used to converting all focal lengths to N f (D −C) + sC F f (D +C) − sC their equivalent in traditional 35 mm format (H = 24 mm)1. This yields 80 mm for the DSLR and 27 mm for the smartphone— The circle of confusion being smaller than the aperture by several which corresponds precisely to the factor of three between the orders of magnitude, D −C ≈ D +C ≈ D and we obtain for the two subject distances. depth of field There are several reasons why smartphones use moderate 2CD f s2 wide-angle lenses. Since they do not provide zoom, they have DOF = sF − sN = 2 2 2 2 : to provide a versatile prime lens that fits the majority of situa- D f −C s tions. Moderate wide-angle is also a sweet spot for lens designers For portrait photography, where the subject is much closer and allows the best trade-off between sensor size and smartphone than infinity, we can make a second approximation and suppose thickness. Longer equivalent focal lengths either require f to in- that C2s2 is negligible against D2 f 2. This finally yields crease (which makes the phone thicker) or H to decrease (which makes the camera capture less light). 2Cs2 2Cs2N DOF p ≈ = For portrait photography, however, moderate wide-angle is D f f 2 not ideal: it causes the face to be a bit distorted (“big nose effect”) and scales down the surroundings compared to the subject, often where N = f =D is the f-number of the lens. leading to agitated backgrounds that draw attention away from the We still have to determine C, which, depending on the cam- subject. era and the viewing conditions, can be limited either by diffrac- Depth of field is the distance between the closest and far- tion, sensor resolution, display resolution or the resolution of the thest objects that appear sharp in the image. Supposing that the human eye. subject’s eyes are in focus, we obtain the configuration shown in The influence of diffraction can be observed by com- figure 3. puting the radius of the Airy disk, which is approximately Light rays coming from points in the subject plane converge r = 1.22 lN [1]. Putting l = 550 nm (green light) and N = f /1.8, right in the sensor plane. Light rays coming from points in front we obtain r = 1.2 mm. This is on par with the pixel size of the of or behind the subject converge behind or in front of the sensor, smartphone and far below that of the DSLR. A typical smartphone respectively, and form blur spots on the image—also called circles image sensor has a resolution of 12 Mpx and can therefore resolve of confusion c. For a point to appear sharp in the image, the size details down to 1/5000 of its diagonal. Given that the resolution of its blur spot must not exceed C, the maximum acceptable circle of the human eye is approximately 1 arcmin [2], this corresponds of confusion [1]. According to figure 3, these points lie between to looking at a 8 × 1200 print from a distance of only 10 inches. sF and sN . From similar triangles, it follows that We assume that the majority of people do not look at their images so closely. As a consequence, in practice, C is neither limited s0 − s0 s0 − s0 C by diffraction nor by sensor resolution, but by the viewing con- N = F = 0 0 ditions. According to [3], the camera industry uses C = 1/1500 of sN sF D the sensor diagonal for computing the depth of field scales im- 00 1Not only the sensor size changes between DSLRs and smartphones, printed on lenses. This corresponds to looking at a 4 × 6 print but also the aspect ratio, which is 3:2 for most DSLRs and 4:3 for most from a distance of approximately 16 inches. Whatever the exact smartphones. Most textbooks use the sensor diagonal for calculating the value, the important point is that diffraction and sensor resolution equivalent focal length. For the sake of symmetry, we decided to crop the DSLR image in our example in figure 1 so that it becomes 4:3.

Image Quality Benchmark of Computational Bokeh

Breaking Down the “Cosine Fourth Power Law”

Denver Cmc Photography Section Newsletter

Digital Holography Using a Laser Pointer and Consumer Digital Camera Report Date: June 22Nd, 2004

Completing a Photography Exhibit Data Tag

What Resolution Should Your Images Be?

Depth-Aware Blending of Smoothed Images for Bokeh Effect Generation

DEPTH of FIELD CHEAT SHEET What Is Depth of Field? the Depth of Field (DOF) Is the Area of a Scene That Appears Sharp in the Image

Sample Manuscript Showing Specifications and Style

Section 10 Vignetting Vignetting the Stop Determines Determines the Stop the Size of the Bundle of Rays That Propagates On-Axis an the System for Through Object

Seeing Like Your Camera ○ My List of Specific Videos I Recommend for Homework I.E

Secrets of Great Portrait Photography Photographs of the Famous and Infamous

Foveon FO18-50-F19 4.5 MP X3 Direct Image Sensor