Image capture and processing challenges--and solutions--in portable designs--Part IV
Here is the final segment of a four-part article series looking at the trends and design challenges of image acquisition and processing on cell phones and other hand-held platforms. Part 4 is a look at consumer features.
By Giles Humpston, Tessera
Mobile Handset DesignLine
(12/01/08, 12:06:00 AM EST)
Part I
Part II
Part III

The number of images and length of video footage captured by consumers has increased exponentially since the advent of digital photography. Almost all cell phones have at least one camera--a high-resolution device pointing outwards for photography and a lower-resolution device aimed at the user for video conferencing. Camera phones are compact, easy to use and usually readily available through being permanently turned on and quite literally "to-hand."

The adoption of wafer-level packaging and new manufacturing techniques has reduced the price of camera modules to the point where they are now considered a standard item on most phones, like a color screen. Innovation in die packaging and optics has reduced the height of camera modules so they can be incorporated in the thinnest of cell phones. Software-enhanced optics improves the picture quality of these compact and low-cost camera modules to the point where they can compete with the digital still cameras of comparable resolution.

The vast majority of consumers are not natural or trained photographers. They do not understand the principles of exposure and through failure to read manuals, frequently don't know how to adjust camera settings. Consequently the majority of digital images are (or should be) discarded. This is an unsatisfactory situation for all parties: the camera designer who has invested heavily to deliver good optical performance, the network operator looking to boost revenue from data transmission, and the user due to mediocre pictures.

A look through most photograph albums will confirm that people are the subject matter of the majority of photographs. Even when capturing a great scenic view it is traditional to position a family member or friend to one side of the frame. Because our subconscious latches onto faces, the one region of a photograph that must be exact is the principal face. Given that the average user is unable or unwilling to change the camera settings for each photograph, software that can locate faces and ensure they are without optical aberration can greatly enhance the perception that a camera takes really great photographs. Having a face correctly exposed, in focus and properly color balanced can make the difference between user satisfaction and the photograph being discarded. The goal is to endow camera phones with features that enhance the picture-taking experience and gratify users with their photographic skills. The tool available to accomplish this is software using the process of numerical image enhancement.

Numerical Image Enhancement
Features made possible by numerical image enhancement--or "smart imaging"--can be broadly grouped into two categories: those that improve image quality and that are often transparent to the user; and those that gratify the user by providing an experience.

The camera lens designer is often faced with the challenge of managing chromatic distortion. The trade-off is between performance and cost, as more expensive lens materials generally possessing better optical properties. An example is "purple fringing," a distortion that occurs in areas of high contrast shot with inexpensive lenses. Purple fringes can show up along a roofline set against a bright sky, or around bare tree branches. An example is shown in Figure 9. Purple fringe generally only becomes visible when a picture is enlarged on a video monitor, so will not be apparent when the customer tests the camera phone in the shop prior to purchase. Fortunately, chromatic aberration is constant for a given optical design. Therefore, rather than tolerate a higher-cost lens solution, the camera module manufacturer can use an inexpensive lens train and apply numerical image enhancement to correct the appearance of the images with software.


Another example of numerical image enhancement to improve image quality is red-eye reduction. Red-eye is a common complaint of low-light photography. For people of Asian descent the effect often manifests itself as golden-eye. The effect has several causes, including, most commonly, light from a flash source being reflected by the retina back into the camera. The problem is most severe when the flash source is located close to the optical axis of the camera, as it is on a camera phone. While the trend today is toward better low-light performance without flash, customers expect high-end camera phones to have flash and continue to have the need for red-eye and golden-eye removal. There is also the unwritten presumption that having paid additional money for the flash option, the customer is more dissatisfied by a good quality picture spoiled by red-eye than a mediocre picture without it. While various image enhancements like red-eye removal can be done on pictures after downloading them to a computer, this fix requires time, a software package and some computing knowledge to achieve a good end result. Because most consumers simply want pictures without red- or golden-eye, the preferred approach is to embed the necessary algorithms on the cell phone and correct the affected photographs without any user intervention or even knowledge that it has been done (see Figure 10).


Face-based Imaging
Because faces are so important to the perception of photographic quality and whether or not a consumer retains or discards a photograph, many numerical image enhancement solutions are based around face-based imaging.

The first challenge of face-based imaging is to identify the faces in the scene. Mathematically this is not a trivial exercise. Face identification is challenging because of the diversity of face types, and is further complicated when photograph subjects wear glasses, hats, earrings and other accessories. The solution commonly adopted is one of statistics. One commercial software program has approximately 200 rules of what constitutes a face, looking for artifacts like hairlines and eyes-nose and ears-mouth triangulations. The software decides it has found a face when approximately 10% of the rules flag as valid. Because face tracking is done in real time, once the software locks on to a face it can continue to track its location in the image until the shutter is pressed.

Most camera phones have limited dynamic range, which restricts their ability to capture detail in both light and dark areas at the same time. Pictures of people standing with their back to a window often results in a white background with the person reduced to a dark silhouette. While it is possible to design and manufacture CMOS imagers with wide dynamic range they are relatively expensive components. Software solutions can go a long way toward correcting the imbalance so the user is not disappointed by even a badly composed photograph (corrected by the camera). By equipping the camera with face-based imaging, the prominent face in the field of view can be identified and the exposure, focus and color balance all optimized for that region and the photograph rendered acceptable, as shown in Figure 11.


Once faces have been identified, further interesting camera features become possible. Two obvious examples are smile and blink detection (see Figure 12). The ability to acquire an image only when the faces are smiling and not blinking greatly improves the ratio of good to mediocre photographs and boosts the user's self-esteem as a photographer and satisfaction with the camera phone in general. A tantalizing possibility that arises from face detection is face recognition. In the future it may be possible to automatically tag photographs with information about the people they contain. Photo albums could then be electronically cataloged by content, rather than just date as at present.


One of the consequential benefits of identifying faces in photographs is that a face provides orientation data, offering a simple solution for auto-rotation of handsets without requiring additional hardware. Electro-mechanical devices to determine handset orientation with respect to gravity are not particularly cheap, small or low-power devices, so substitution by an existing camera module plus some software is highly desirable.

Integration
One of the challenges presented by software-enhanced optics and numerical image enhancement is where to put the algorithms on the cell phone. To a certain extent it is decided by whether the algorithm needs to operate on the raw imager data or part of the image data file. Transformations that are applied to raw image data are best embedded on the CMOS imager, particularly if they require less than 100k gates. However even this distinction is not clear-cut as it is usually possible to partition the algorithm so part runs on raw data and part on the image data file. For the software that is not embedded on the imager chip it could be placed either in an image sensor processor, a co-processor or even the baseband processor.

Particularly for the latter two sites there is the further choice of firmware or software. Generally the more advanced the handset the greater is the inherent computational ability, and hence the algorithms can be located further from the camera module. With handset manufacturers looking to find additional revenue streams, the possibility of selling a base model handset that can be upgraded in performance by purchasing the appropriate software is another possibility.

The most efficient approach to software integration is to obtain all of the algorithms required for the software-enhanced lenses and consumer features from a single source. This is because many of the algorithms require common inputs to function. For example most software that does image manipulation performs a background calculation of grey scale range and color balance. Obtaining an integrated software solution avoids duplication of resources since a single grey scale range calculation can be made available to all of the algorithms that need it. Fundamental metrics common to all algorithms can be derived once and reused among the numerical image enhancement and software-enhanced optics solutions as necessary for each picture. The software can be further layered with higher-level features built on lower platforms. The result is more robust, compact, faster and more easily upgraded software, all of which reduces the cost of ownership.

Tessera is one of very few companies able to provide the full integrated suite of technology required for low cost, compact and feature-rich camera modules for cell phones, including wafer-level packaging of imagers, wafer-level optics, wafer-level camera solutions, software enhanced optics and numerical image enhancement.

About the Author
Giles Humpston, Ph.D., serves as Director, Research and Development of Tessera. Dr. Humpston has spent his entire professional career working in the field of semiconductor packaging, initially for military applications and more recently for high volume consumer products. He is a metallurgist by profession and has a doctorate in alloy phase equilibria. Dr. Humpston is a cited inventor on more than 75 patents and has co-authored several text books on metallic joining processes. His work and technical publications have been recognized with five international awards. Dr. Humpston's current interests are packaging of solid state camera modules and product miniaturization through wafer level technologies. He can be reached at: ghumpston@tessera.com

For more on the subject:

Tips and Tricks: The critical nature of cell phone camera packaging
What you need to know about imaging solutions for camera phones
Cameras in handsets evolving from novelty to DSC performance, despite constraints
Image pipeline: Fine-tuning digital camera processing blocks
Get enlightened about camera phone flash units--compare Xenon to high current LEDs