Intel® Perceptual Computing SDK 2013 GOLD Release Notes

21
Intel ® Perceptual Computing SDK 2013 GOLD Release Notes (Version 13842) Overview The Intel® Perceptual Computing SDK is a library of pattern detection and recognition algorithm implementations exposed through standardized interfaces. The library aims at lowering barriers to using these algorithms and shifting the application developers’ focus from coding the algorithm details to innovating on the usage of these algorithms for next generation human computer experience. Features – Change Logs The SDK sets the environment variable PCSDK_DIR to the installation location. By default, PCSDK_DIR points to C:\Program Files (x86)\Intel\PCSDK. The SDK provides the following documentation under $(PCSDK_DIR)\doc: Name Reference Manual Description CHM/sdkhelp.chm The CHM version comprehensive documentation about developer guide, reference manuals, samples and tutorials. CHM/sdkmanuals.chm The CHM version reference manuals. HTML/index.html The HTML version comprehensive documentation about developer guide, reference manuals, samples and tutorials. PDF/sdkdevguide.pdf The PDF version developer guide. PDF/sdkmanuals.pdf The PDF version reference manuals. PDF/sdkprivacy.pdf The PDF version privacy notification guide. PDF/sdksamples.pdf The PDF version sample guide. PDF/sdktutorial-cpp.pdf The PDF version C++ tutorial. PDF/sdktutorial-csharp.pdf The PDF version C# tutorial. PDF/sdktutorial-open.pdf The PDF version openFrameworks* tutorial.

Transcript of Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Page 1: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Intel® Perceptual Computing SDK 2013 GOLD

Release Notes (Version 13842)

Overview

The Intel® Perceptual Computing SDK is a library of pattern detection and recognition

algorithm implementations exposed through standardized interfaces. The library aims at

lowering barriers to using these algorithms and shifting the application developers’ focus

from coding the algorithm details to innovating on the usage of these algorithms for next

generation human computer experience.

Features – Change Logs

The SDK sets the environment variable PCSDK_DIR to the installation location. By default,

PCSDK_DIR points to C:\Program Files (x86)\Intel\PCSDK.

The SDK provides the following documentation under $(PCSDK_DIR)\doc:

Name Reference Manual Description

CHM/sdkhelp.chm The CHM version comprehensive

documentation about developer guide,

reference manuals, samples and tutorials.

CHM/sdkmanuals.chm The CHM version reference manuals.

HTML/index.html The HTML version comprehensive

documentation about developer guide,

reference manuals, samples and tutorials.

PDF/sdkdevguide.pdf The PDF version developer guide.

PDF/sdkmanuals.pdf The PDF version reference manuals.

PDF/sdkprivacy.pdf The PDF version privacy notification guide.

PDF/sdksamples.pdf The PDF version sample guide.

PDF/sdktutorial-cpp.pdf The PDF version C++ tutorial.

PDF/sdktutorial-csharp.pdf The PDF version C# tutorial.

PDF/sdktutorial-open.pdf The PDF version openFrameworks* tutorial.

Page 2: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

PDF/sdktutorial-processing.pdf The PDF version Processing* tutorial.

PDF/sdktutorial-java.pdf The PDF version Java* tutorial.

PDF/sdktutorial-unity.pdf The PDF version Unity* tutorial.

PDF/sdktutorial-havok.pdf The PDF version Havok* tutorial.

Release 13842 (GOLD R8):

This release provides the following major changes:

o Pulse Estimator Preview module

o Emotient* emotion detection Preview module

o Support for Nuance* Genie* Command And Control, Dictation, Text To Speech

(This is supported only for Windows* 8)

o Additional projection interface Color to depth projection

Release 12492 (GOLD R7):

This release provides the following major changes:

o Capture Viewer enhancements for 3D

Release 12029 (GOLD R6):

This release provides the following major changes:

o Tracing capability is enabled and the trace information can be viewed via sdk_info

o C++ and C# sample raw_streams which unifies raw stream playback and

projection visualization

Release 10898 (GOLD R5):

This release provides the following major changes:

o A DepthBall demo application has been added to showing development with

Microsoft XNA and 3D data. This needs MS .NET FW 4.0 installed on the machine.

o Enabled UtilCaptureFile for real-time recording.

Release 9870 (GOLD R4):

This release provides the following major changes:

o Changes to address the Command & Control interoperability issue with Dragon

Page 3: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Assistant application; Dictation will be fixed in later release.

o Added documentation which captures Best Known methods for Voice Recognition

– to tune audio volume and confidence thresholds for improved recognition.

o Camera Calibration tool: The Intel® Perceptual Computing SDK 2013 R3 (version

9302) shows a misalignment in the UV map and color image with earlier versions

of the Creative camera developer kits. The camera calibration tool is a part of R4

and is a short-term solution to compensate certain camera offset mapping depth

to color coordinates. This is a developer tool only. For end users, the cameras are

calibrated during the manufacturing process.

Release 9302 (GOLD R3):

o This release provides the following major features:

o Background Segmentation/Removal module

o Support for multiple languages in Speech

deDE German (Germany) enGB English (Great Britain) enUS English (United States) frFR French (France) itIT Italian (Italy) jaJP Japanese (Japan) ptBR Portuguese (Brazil) esUS Spanish (United States)

zhCN Chinese (PRC)

Release 8779 (GOLD R2):

o This release provides the following major features:

o Support Java application development.

o Enable projection support in framework porting libraries.

o The voice_recognition and voice_synthesis samples are rewritten to be GUI

applications for UNICODE input/output.

o Rewrote the gesture_viewer sample to be GUI based and include the simple

(UtilPipeline-based) and advanced pipeline (UtilCapture-based). Removed the

gesture_viewere_simple sample.

o Rewrote the gesture_viewer_simple.cs (C#) sample to be GUI based and renamed

it to be gesture_viewer.cs.

o Added a face_analysis.cs sample (C#) to demonstrate face detection and

Page 4: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

landmark tracking.

o Added a tutorial to support using the SDK under Havok Vision* SDK.

o The face attribution detection feature is deprecated. Applications built with the SDK

GOLD version will be still functional in SDK GOLD R2.

o Major revisions to documentation including C# function definitions and samples

Release 7383 (Gold):

o This release provides the following major features:

o Native Microsoft* Visual Studio* 2008, 2010 and 2012 project files for samples.

o Two usage demo applications:

The AugmentedFarm sample application displays an augmented book, using the

object tracking technology.

The RemoteControl sample application demonstrates driving non-SDK

applications using SDK enabled finger, gesture and voice events.

o With the GOLD license, the developers can create commercial SDK applications.

Please refer to the SDK developer guide $(PCSDK_DIR)/doc/PDF/sdkdevguide.pdf

for application deployment recommendations and requirements, the redistributable

files $(PCSDK_DIR)/redist.txt, and the license terms $(PCSDK_DIR)/Intel PCSDK

EULA.rtf.

Release 6765 (Beta3):

This release provides the following major features:

o The Creative Senz 3D* I/O module exposes the PXCProjection interface for

mapping RGB, depth and world coordinates. See sdkmanual-core.pdf for how to

use the interface. The camera_uvmap sample illustrates the usage.

o The finger tracking module exposes a new pose BIG5 and three new gestures

CIRCLE, SWIPE_UP, and SWIPE_DOWN.

o The Unity and Processing framework ports now support voice recognition and

dictation. See sdkmanual-framework.pdf for details.

o The finger tracking module and the face tracking module use Intel® AVX instructions

set for performance optimization. As a result, the SDK requires Microsoft Windows 7 SP1 or later.

o The SDK provides an experimental runtime installer for end-user system installation.

Page 5: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

The installer is a silent installer designed to be integrated as part of the application

package. Please report any issues or request additional features.

o The SDK includes two usability sample applications under the demo directory:

The GlassBalls sample application shows how to use shadow hands to position

and interact with physical objects.

The HeadCoupledPerspective sample application shows how to use face

detection to provide a different view perspective in 3D object visualization.

Release 5831 (Beta2):

This release provides the following major features:

o Nuance* speech recognition and speech synthesis. Download the required voice

add-on package (non-redistributable and for development only) from

http://www.intel.com/software/perceptual. The following samples demonstrate the

usages:

voice_recognition Command control and dictation in C++

voice_recognition.cs Command control and dictation in C#

voice_synthesis Speech synthesis in C++

o Depthmap and the Creative Senz 3D* camera I/O module. See the depth_smoothing

sample for comparison of raw and smoothed depthmap streams.

o Improved finger tracking and gesture recognition for tracking stability and

pose/gesture recognition accuracy.

o Improved framework and game engine support for multithreaded applications.

o Improved privacy notification tools for better visualization of applications that access

to the I/O devices.

Release 5057 (Beta1):

This release provides the following major features:

o Color, depth and audio stream capturing; file recording and playback.

o Finger tracking and pose/gesture recognition.

o Face tracking and facial landmark tracking.

o 2D/3D markerless object tracking.

o This release provides the following experimental features for feedbacks:

Privacy monitoring utility that visualizes certain data access related to personal

identifiable information.

Page 6: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Depth stream filtering capability in the depth_smoothing sample.

System Requirements

Hardware

2nd and 3rd generation IA-32 or Intel® 64 Intel® Core™ architecture processors (with Intel® AVX support).

1GB free hard disk space.

The Creative Senz 3D* camera (firmware version 2028 or later.)

The Creative Senz 3D* Interactive Gesture camera KIT is meant for application developers to use

with the Perceptual Computing SDK 2013. The Creative Senz 3D* Interactive Gesture camera KIT may

have known defects and errata which will be published along the product.

USB-2 or USB-3 port.

Software

Microsoft Windows 7 (with SP1) or Microsoft Windows 8.

Microsoft Visual Studio C++ 2008, 2010 or 2012 with Service Pack 1 or later versions.

Microsoft .NET* 4.0 framework for C# development.

SoftKinetic* DepthSense* SDK runtime 1.0.4 or later.

Unity PRO 3.5.1f2 or later for Unity 3D development.

Processing 1.5.1 or 2.0 beta7 for Processing framework development.

Java JDK 1.7.0_11 for Java development.

openFrameworks* v0071 or later for openFrameworks development.

Nuance* Dragon Assistant* Core Engine 1.1.7 or later & Nuance Language Pack 1.1.1 or later.

For development purpose, these Nuance components are a part of the Intel® Perceptual

Computing SDK developer package. There is no redistribution rights granted for these.

These Nuance components are available only on qualified platforms. The application developer can negotiate directly with Nuance for redistribution rights on other platforms.

Havok* Vision* SDK v2012.2.1 or later for Havok development.

Page 7: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Installation

Remove any previous SDK packages. Cleanup destination directory if needed.

Run the SDK web-based or off-line installer also from http://www.intel.com/software/perceptual.

Reboot your computer. This step is critical to ensure environmental parameters

propagate to all applications.

API Change Logs

Change with Release 12492 (GOLD R7):

Added pxcpulseestimator.h

Added pxcemotion.h

Change with Release 12492 (GOLD R7):

None

Change with Release 12029 (GOLD R6):

None

Change with Release 10898 (GOLD R5):

Added the following device properties. See the SDK reference manuals for details. o DEVICE_PROPERTY_DEPTH_UNIT

o DEVICE_PROPERTY_CAMERA_MODEL

Added the following functions to UtilCapture and UtilPipeline for directly accessing

color formats: o UtilCapture::QueryImageByFormat

o UtilPipeline::QueryImageByFormat

Added property info and deprecated property imageInfo in C# PXCMImage. The

property imageInfo is not CLS compliant.

Added property info and deprecated property audioInfo in C# PXCMAudio. The

property audioInfo is not CLS compliant.

Added Util[M]Capture::SetFilter(PXC[M]Capture::DeviceInfo) for locating a

specific device.

Added the grammar field in PXC[M]VoiceRecognition::Recognition to distinguish

between different grammars in the command and control mode.

Added the confidence field in PXC[M]VoiceRecognition::ProfileInfo to set

confidence threshold for voice recognition. The argument threshold in the

PXC[M]VoiceRecognition::SubscribeRecognition function is deprecated.

Change with Release 9870 (GOLD R4):

None

Page 8: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Change with Release 9302 (GOLD R3):

A new API for Background Removal pxcsegmentation.h has been added. See

reference manual for details

Change with Release 8779 (GOLD R2):

The face attributes detection API PXCFaceAnalysis::Attribute is deprecated in C++

and C#. Applications compiled with Gold release are still compatible with Gold R2.

The voice parameter is deprecated in the PXCVoiceSynthesis::ProfileInfo interface.

Currently there is only a single voice (some male and some female depending on the

language setting) for each language.

The following C# functions are added to help access the ImageData buffers. The

PXCMImage.QueryBitmap function is deprecated (but still available).

o PXCMImage.ImageData.ToByteArray

o PXCMImage.ImageData.ToShortArray

o PXCMImage.ImageData.ToFloatArray

o PXCMImage.ImageData.ToBitmap

o PXCMImage.ImageData.ToWriteableBitmap

Added ProcessAudioEOS() in the C++/C# voice interface to signal end of stream,

replacing the C# function ProcessAudioAsync(), which is deprecated but still available.

The QueryLandmarkData function in Processing relaxes the requirement that the array

structure be fully initialized. The red line below is now optional.

PXCMFaceAnalysis.Landmark.LandmarkData[] data=new

PXCMFaceAnalysis.Landmark.LandmarkData[6];

// for (int i=0;i<data.length;i++) data[i]=new

PXCMFaceLandmark.Landmark.LandmarkData();

pp.QueryLandmarkData(face_id,PXCMFaceAnalysis.Landmark.LABEL_6POINTS,da

ta);

The following functions are added to the framework support:

o ProjectImageToRealWorld

o ProjectRealWorldToImage

o MapDepthToColorCoordinates

o SetVoiceLanguage

Page 9: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

The following enhancements are made to access the 60fps QVGA depth stream:

o The C++ UtilCapture::SetFilter or C# UtilMCapture.SetFilter

functions can now set filters to specify size and frame rates.

o Unity, Processing and Java programs can use the mode DEPTH_QVGA_60FPS to

request the 60fps depth stream.

The Unity API adds QueryRGB(Int32[]) to retrieve raw RGB data in an integer array.

The PXCMCapture.AudioStream.ReadStreamAsync function protocol is fixed as follows:

pxcmStatus ReadStreamAsync(out PXCMAudio audio, out PXCMScheduler.SyncPoint sp);

The previously definition mistakenly used “ref” on the audio parameter.

Change with Release 7383 (Gold):

None

Change with Release 6765 (Beta3):

The new PXCProjection interface exposes functions to map RGB, depth and world

coordinates. The Creative Senz 3D* I/O module exposes such interface. See the sdkmanual-core.pdf for details.

The SDK session implements the PXCMetaData interface to provide centralized

configuration storage. The application can derive the PXCMetaData instance from a

PXCSession instance.

The PXCMetadata interface exposes additional functions, AttachSerializable and

CreateSerializable to save or recreate serialization instances such as the

PXCProjection interface.

The module instance creation function CreateInstance in the PXCSessionService::

DLLExportTable structure is extended to take more input parameters for future

extension. The old interface is deprecated but still supported in this release.

The UtilPipeline utility classes added the following functions to start/stop modular

operations: PauseFaceLocation, PauseFaceLandmark, PauseGesture,

PauseVoiceRecognition.

The SetVoiceCommands function in the UtilPipeline utility classes is redesigned to take

a list of voice commands as input. The application can use this function during

recognition to change the active grammar.

The SetVoiceDictation function is added in the UtilPipeline utility classes to set the

dictation mode.

The Unity and Processing framework functions are aligned on behaviors for better

scalability. See the sdkmanual-framework.pdf for details.

Page 10: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Change with Release 5831 (Beta2):

Introduce the new PXCProjection interface which exposes the following methods

PROPERTY_DEPTH_CONFIDENCE_

THRESHOLD Set the depthmap flooring threshold.

PROPERTY_DEPTH_SMOOTHING Set the depthmap smoothing mode.

See sdkmanual-core.pdf for details.

The PXCCapture::VideoStream::DataDesc::DeviceCap structure is deprecated and

redefined as PXCCapture::Device::DeviceCap. See sdkmanual-core.pdf for details.

The application no longer needs to derive from the PXCBaseImpl template to implement

any callback functions. The application can implement the callback functions by directly

deriving from the corresponding callback interfaces. This change does not impact backward compatibility. See sdkmanual-core.pdf for details.

The PXCGesture interface exposes a parameter to configure the activation distance in

the PXCGesture::ProfileInfo structure. The default value is 40 centimeters. Objects

within this distance to the camera trigger hand tracking. It is however not recommended

that the application changes the default value, unless absolutely necessary. Different activation distances in different applications confuse users. See sdkmanual-gesture.pdf

for details. The PXCSmartPtr template exposes a member function IsValid for testing whether the

internal pointer is valid or not. See sdkmanual-core.pdf for details.

The depthmap and vertices visualization is improved in the PXCImage::AcquireAccess

function not to use any confidence values. This change reduces flickering. The depthmap and vertices are now visualized as gray images. See sdkmanual-core.pdf for details.

The UtilCapture utility class expands the SetFilter set of functions to preset device

properties. The function sets the device properties before delivering any device

properties to the algorithm modules. See sdkmanual-core.pdf for details.

The UtilCapture utility class expands the LocateStreams set of functions to search for

an I/O device that can serve both audio and video module data streaming needs. See

sdkmanual-core.pdf for details.

The UtilCaptureFile utility class now supports playing back WAV files. See

sdkmanual-core.pdf for details.

The UtilPipeline utility class is completely rewritten to be modular for supporting

more modules. The current implementation consists of UtilPipelineRaw,

UtilPipelineFace, UtilPipelineGesture, and UtilPipelineVoice that cover raw

stream capture, face detection and location, finger tracking, and voice recognition. The

UtilPipeline utility class exposes all member functions in above modular classes. See

sdkmanual-core.pdf for details.

The PXCUPipeline set of functions and all framework and game engine wrappers are

rewritten to be thread-safe. See sdkmanual-framework.pdf for details.

Changes with Release 5057 (Beta1):

This is the first publicly available release of the Intel® Perceptual Computing SDK 2013

Beta.

Page 11: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Known Issues

This section captures the currently known issues, which are targeted to be fixed in

subsequent GOLD update releases.

The Creative Senz 3D* camera, camera driver, and the I/O module

Issue Description Release

Lower depth and/or color frame rate may be observed when running some Creative cameras > 8 hours

12492 (Gold 7)

A saturated first frame (black frame) may be randomly encountered when recording

with gesture viewer (seen only on a few cameras)

9302

(Gold 3)

The camera’s automatic gain leveling can’t be disabled. 8779 (Gold 2)

The camera’s white balance settings are limited to only three settings. 8779 (Gold 2)

The camera’s balance auto-leveling settings persist after application close. 8779 (Gold 2)

White balance auto-leveling settings persist after the application closes. 8779 (Gold 2)

One application setting the RGB resolution for the camera may prevent another application simultaneously accessing the RGB with a different resolution

8779 (Gold 2)

The MapColorCoordinatesToDepth function of the PXCProjection interface is not

implemented.

6765 (Beta3)

The camera may not be able to stream data at the same time as any integrated

camera, if they are on the same USB bus, due to the USB bandwidth issue.

5057

(Beta1)

The camera and the camera driver do not yet support system suspension. The driver may occasionally hang or crash. The camera laser may be kept on during system suspension. To work around, set the computer in the high-performance mode and use only the AC mode.

5057 (Beta1)

The camera driver may not get any valid color or depth samples if an SDK application tries to access to the camera, when the camera has already been exclusively occupied by any Microsoft Media Foundation or DirectShow* capturing filters.

5057 (Beta1)

There is a slight delay every 10-12 frames in retrieving color and depth streams, due to buffering in the USB software stack. The RGB and depth streams are not

gen-locked. A future version of the camera will resolve this issue. The color and depth pictures are generally in sync within 3 frames.

5057 (Beta1)

The I/O capturing module cannot serve more than a single user on a multiple user system. Multiple accesses from applications of the same user are allowed.

5057 (Beta1)

If the developer enables C++ Exceptions in the Microsoft Visual Studio debugging 5057

Page 12: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

mode, the application may receive first chance exceptions from the I/O capturing

module during the module startup or shutdown.

Workaround: disable the C++ exception (DebugExceptionsC++ Exceptions under

Microsoft Visual Studio 2010).

(Beta1)

The applications may take some time between 5 and 30 seconds to resume after the computer is resumed from the sleep/hibernate state. This timing is platform specific.

5057 (Beta1)

The camera is not compliant with the USB 2.0 High Speed Certification. A new revision of the camera is required to solve this issue.

5057 (Beta1)

The camera does not support the snapshot mode for the depth stream. A future version of the camera will resolve this issue.

5057 (Beta1)

When the camera is detected and then plugged out and back in within a period of less

than 5 seconds, the detection may fail to happen. To work around, wait for more than 5 seconds before plugging the camera back in.

5057

(Beta1)

Video recording is not working with Amcap* when recording in uncompressed mode. When starting the recording, the preview window is black. The speed of the recorded movie is four times slower. When using Amcap but recording in the .wmv mode, the preview windows is still black, but the recorded movie can be played back correctly. The same issue is not detected with other webcam recording tools.

5057 (Beta1)

The reflectivity and opacity of some materials as seen by the 3D camera can be different than seen in the visual spectrum. To work around, do not visualize depth map or confidence map.

5057 (Beta1)

Other I/O modules

Issue Description Release

The Microsoft Media Foundation* camera module does not support dynamic profile change. The application should avoid calling the SetProfile function more than once.

5057 (Beta1)

Gesture module

Issue Description Release

Possible increase in memory consumption when moving hand and fingers extremely close to the camera for extended periods of time.

8779 (Gold 2)

Incorrect geonodes reported for peace pose. The peace pose is correctly requiring that the index and middle finger be present in order to be identified but the geonodes

being reported as active during a peace pose are always the thumb and some other

finger (often the index finger).

8779 (Gold 2)

Finger tracking module

Issue Description Release

The alert LABEL_FOV_BLOCKED will not work when depthmap data is filtered. The SDK

by default filters depthmap data for better quality. To work around, set the device

5831

Page 13: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

property PROPERTY_DEPTH_SMOOTHING to false to disable depthmap data smoothing. (Beta2)

The finger tracking module does not distinguish left and right hands. Use LABEL_BODY_HAND_PRIMARY/LABEL_BODY_HAND_SECONDARY for identifying the

first/second tracked hand, and the side field of the data structure to determine the

sidedness of the hand.

5057

(Beta1)

The finger tracking module does not label fingers accurately (i.e. thumb, index, middle, ring and pinky), unless all fingers are visible at least once to the camera.

5057 (Beta1)

The finger tracking module treats two merged fingers as one and tracks the middle of the two fingers.

5057 (Beta1)

The finger tracking module will confuse hand(s) and head if they are closer than 8 inches. To work around, design your application scenarios so that hands are naturally

away from one’s head at least 8 inches.

5057 (Beta1)

The CPU utilization may peak at 50% on 2nd generation Intel® Core™ processors. The average CPU utilization should be below 10%.

5057 (Beta1)

Face tracking module

Issue Description Release

When face is detected, and the face is moved up slowly (or the camera angle down) until the top of the face comes into contact with the top edge - the detected region becomes highly unstable. The detection rectangle flickers, and re-acquires about

every frame. This happens to a lesser degree against the edges/bottom of the detection area.

9870 (Gold 4)

VIEW_ANGLE_FRONTAL_ROLL option not working for face detector – was only an experimental feature. Plans to deprecate it. Contact SDK team via developer Forum in case this is needed for your app.

9870 (Gold 4)

Landmark detection too jittery to be used for gaming 8779 (Gold 2)

Facial detection very jittery at times even on still faces 8779 (Gold 2)

Face detection drops fairly frequently even when user is just sitting normally 8779 (Gold 2)

Face landmark detection noise from frame to frame is higher in some cases 8779 (Gold 2)

Head tracking too inaccurate for game use 8779

(Gold 2)

Head-coupled display sample is too jittery and unstable sometimes 8779 (Gold 2)

Page 14: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Face recognition component not adequate for face login 8779 (Gold 2)

Facial landmark detection rate drops when head is tilted more than ~10% left/right 8779 (Gold 2)

Facial recognition and detection rates drop when user is wearing glasses 8779 (Gold 2)

Landmark detection in the face tracking module does not support pose estimation. The function QueryPoseData always return PXC_STATUS_ITEM_UNAVAILABLE.

5831 (Beta2)

The face tracking module will lose tracking if the object moves too fast in front of the camera. The algorithm is designed for still or slow-motion image sequences.

5057 (Beta1)

Face size does not update as subject moves close to and further from camera when using the face_detection.exe sample. This effect is more evident when streaming

with higher resolutions.

6765 (Beta3)

Face analysis may perform very slowly at higher resolutions (1920x1080) (~10FPS) 6765 (Beta3)

Landmark detection doesn’t respond well to open mouths. With open mouth, you may see landmark detector switch mouth corner nodes to nose.

6765 (Beta3)

Motion/jitter correction threshold may be set too high when streaming high resolution. 6765 (Beta3)

If a face to be detected is tilted upward, the face_detection sample may not identify

the face to be the same person and rapidly update face identifier.

6765 (Beta3)

Nuance* voice recognition/synthesis modules

Issue Description Release

Each language model for dictation needs >500M memory. As the number of simultaneous dictation apps increase, the number of instances of the language model in memory too increases, thereby increasing the memory footprint.

12029 (Gold 6)

Slow voice dictation recognition may be observed when running for the first time after fresh install

10898 (Gold 5)

Voice dictation quality for other languages is not as good as US English 9302

(Gold 3)

If recording volume/level set too high or too low then voice recognition rates may

significantly drop. Refer to the Voice Best Known Methods in the documentation to tune tis for improved recognition.

8779 (Gold 2)

The voice recognition module supports multiword grammars when working in the

command and control mode. The application must set the endOfSentence

configuration parameter longer than the expected silence duration between words.

5831

(Beta2)

The voice recognition module recognizes short sentence up to 30 seconds when working in the dictation mode. To work around, always dictate sentences shorter than 30 seconds.

5831 (Beta2)

Page 15: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

The voice synthesis module supports only a female or male voice output (varies by

language). The configuration parameter to choose different voices is ignored. 5831 (Beta2)

Total Immersion* object tracking module

Issue Description Release

Open D'Fusion Studio Select Menu -> ? -> Help

You will get error message stating help pdf file can't be found. But, the file in included in the packaging.

13842

(Gold 8)

The D’fusion studio tool switches between the classic and lite mode depending

on how it is started. The tool appears to start in lite mode by default but if a

new session is selected, the tool switches to classic mode. There is no flag in

the tool to switch between the modes. The outcome may change if the targets

are trained in the wrong mode.

12029

(Gold 6)

The Total Immersion Studio application may show motion glitches when

visualizing rotating objects in front of the Creative Camera. 10898

(Gold 5)

The object tracking module does not track very well with reflective materials or under

poor lighting conditions.

7383

(Gold)

Pulse Estimator:

Issue Description Release

The Algorithm imposes the following restrictions:

Keep as still as possible, and face the camera fully. Excess motion can disrupt estimation.

Keep a light source in front of you. Bright lights behind you may interfere with detection.

If possible, disable your camera's automatic white balance and exposure settings.

Experiment with profile settings until you find a combination that works well for you.

13842

(Gold 8)

Utilities and tools:

Issue Description Release

During recording, the UtilCaptureFile utility caches all images in memory until they

are successfully written to the disk. In some cases, the application may see constant memory increase if the disk I/O speed cannot catch up with the image recording speed. To work around, record only a small number of frames or use faster SSD disks.

9870 (GOLD R5)

The sdk_info (camera_info in beta2) cannot reliably retrieve information while other

applications are accessing the camera. Run this tool without any other applications

accessing the camera.

5057

(Beta1)

The sdk_info (camera_info in beta2) may not retrieve the TOF controller version and

the USB backend version reliably in some cases. In such case, the TOF controller version shows 65535. Use the rescan button to rescan the device to retrieve the right values.

5057 (Beta1)

Language ports, frameworks and game engines

Page 16: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Issue Description Release

Java process is still running in the background if the user clicks the Processing IDE stop

button (as opposite to closing the Processing rendering window.) This is a known Processing issue but not a SDK defect. The user should close the Processing rendering window.

6765

(Beta3)

Installer/Uninstaller

Description Release

DepthSenseSDK doesn't uninstall automatically – need manual removal from Add/Remove programs

12492 (Gold 7)

Random failures of language pack installation. This seems to happen sometimes when

all languages are selected for install. A reinstall works fine. Issue is being investigated. 9870

(Gold 4)

Installer may fail with the following error. Module Name DepthSense SDK 1.2.1.840 Win64-VS2010-Release redist signed.exe. Application returned Error Code 2. This is a Depth Sense SDK driver installer issue and is being looked into by SoftKinetic.

9302 (Gold 3)

Installation of SDK package without DA language pack can be accomplished by

following these steps: Deselect the DA Core will cause language packs to be deselected. Selecting DA Core again will cause it to be selected alone.

8779

(Gold 2)

Installation may fail if common registry keys are locked. A system reboot may be required.

8779 (Gold 2)

The web-based installer does not support proxy authentication for all existing

methods. If the installer doesn’t work for your proxy settings by default, you can try

the Internet Explorer settings by adding the “--system-engine-download” switch

(without quotes) to the installer command line, or use the offline installer.

7383

(Gold)

Text on splash screen could be cut off for Medium / Larger font system settings 7883 (Gold)

The installer does not remove registry keys with settings created by Unity demos 7883

(Gold)

The uninstaller does not remove user created files and registry keys. Remove them manually if needed.

5057 (Beta1)

Samples / Demos

Issue Description Release

These sample/demo apps are not compile-able under VS 2012 Express: capture_viewer, Remote Control.

These samples require the Microsoft MFC. 9870 (Gold 4)

RemoteContol demo does nothing when using any sensor (e.g.: voice and gesture) for

executing any action on "explorer" group (tried windows navigation and key press) on Ultrabook™ system with Win8 x64. The same steps on notebook with Win7 x64 works as expected. If any_active_application group option or non group application name like "My Computer" (explorer) is chosen, it works. Fails only group named "explorer"

9870 (Gold

4)

Page 17: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Head Coupled Perspective demo shows solid color instead of picture when the head is at the top

9870 (Gold 4)

The following message may appear when running a console application: “WARNING: Invalid Operation Exception: The main event loop is not running.” This message is harmless.

8779 (Gold 2)

Simple module vs2008 solution generates simple_module.exe instead of simple_module_d.exe when compiling in debug mode.

8779 (Gold 2)

The GlassBalls and AugmentedFarm demo applications may exhibit high CPU

utilization up to 75% under the DC mode.

7383 (Gold)

The AugmentedFarm demo requires Unity Pro 4.x or higher 7383 (Gold)

The HeadCoupledDisplay demo application may momentarily run slowly or pause on

some systems when moving the head too quickly

6765 (Beta3)

SDK Core

Issue Description Release

Illegal instruction issue may be observed on CPU which do not support Intel® Advanced Vector Extensions 13842 (Gold 8)

PerC SDK Application may see a memory leak if it creates and deletes PXCSession object many times during

execution, the leak is about <10Kb per session 12029 (Gold 6)

The following issues are fixed since 13842 (Gold R8)

The Creative Senz 3D* camera, camera driver, and the I/O module

Issue Description Release

A Timeout exception may be seen if Creative camera left plugged in for extended periods of time

13842 (Gold 8)

GUI samples Menu->Device does not work if multiple depthsense cameras are

connected -- only the first camera in list streams 13842

(Gold 8)

Older cameras show an offset in the alignment which affects the uvmapping and

projection mapping. This can be alleviated by adjusting the offset using the calibration tool in this release. However, If we move color principal point, some area of the color frame could lack depth data.

9870

(Gold 4)

When converting RGB24 to RGB32, the A value always hardcoded to 128 9302 (Gold 3)

If accessed through Microsoft Media Foundation*, the camera exposes two color streams. The second stream is not playable. (It does not deliver any data sample.) The SDK temporarily disables the second stream in the SDK Media Foundation I/O module

7383 (Gold)

Page 18: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

and in the capture_viewer tool. Recommended to always access the camera through

the SDK capture module.

Setting device property DEVICE_PROPERTY_COLOR_GAMMA always returns

PXC_STATUS_DEVICE_FAILED.

5831

(Beta2)

When using a laptop in balanced power mode running on battery, the detection of the camera on the USB port may fail or only detect partial features. To work around, set the power option to the performance mode.

5057 (Beta1)

The camera depth stream may interfere with the color stream in low-light conditions, when the application retrieves both color and depth streams. There will be horizontal strips in the color images. To work around, use the camera in normal-light conditions, or use only the color or depth stream.

5057 (Beta1)

The camera LED indicator is partially working. A future version of the camera will solve

this issue.

5057

(Beta1)

The camera only supports audio sampling at 44.1KHz. Other sample rate may not work properly. To work around, always use the 44.1KHz sampling rate.

5057 (Beta1)

The RGB device properties such as brightness, hue and exposure are read only. Future improvement will enable setting these parameters.

5057 (Beta1)

32-bit applications and 64-bit applications cannot stream color/depth data at the same time. The camera data can be shared among 32-bit applications or 64-bit applications.

5057 (Beta1)

Finger tracking module

Issue Description Release

Incorrect geonode coordinates (duplicates in the coordinates fixed) 9302 (Gold 3)

When two hands are overlapping, the finger tracking module may incorrectly calculate the palm center position to be close to the forearm position. To reduce impact, design

use cases where two hands never overlap.

5057 (Beta1)

The position of the grabbing points (LABEL_HAND_UPPER, LABEL_HAND_MIDDLE,

LABEL_HAND_LOWER) may vary frame by frame.

5057 (Beta1)

If the Creative Senz 3D* camera is positioned on a table, the finger tracking module

may mistake the table or any additional unrelated objects as valid hands. To avoid this, always put the camera on top of the laptop lid. This is the only designed usage position.

5057 (Beta1)

The pose LABEL_POSE_THUMB_UP is not reliably recognized if performed outside the

camera’s tracking activation zone, about 40cm away from the camera. To work around, first move inside the activation zone so that the SDK module starts to track the fingers, then perform the thumb up pose.

5057 (Beta1)

Face Analysis

While setting face feature profiles on a different thread, the app may run into a race condition and experience

a crash 9302

Page 19: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

(Gold 3)

Background Segmentation/Removal Module:

Older cameras show an offset in the alignment which affects the uvmapping and projection mapping. This

affects the quality of the segmented image. This will result in the segmented object being incorrectly offset

from the original image when displayed. This issue can be addressed by using the Calibration tool in the R4

release of the SDK.

However, If we move color principal point, some area of the color frame could lack depth

data.

9870 (Gold 4)

Nuance Voice Recognition/Synthesis module:

Issue Description Release

When using two instances of voice recognition sample in dictation mode, recognition results may not be identical

13842

(Gold 8)

Playback of long recorded files in 32bit app may freeze. The workaround is to record

by specifying the number of frames in the sample app. The playback works fine with 64 bit version of apps.

12029

(Gold 6)

Voice recognition[cs] samples in dictation mode do not report fails if someone else uses dictation (module busy).

12029 (Gold 6)

The Nuance core engine works in single instance of Dictation. If the application creates

multiple instances of the voice recognition (in Dictation) or voice synthesis module, only the first instance will function. Voice recognition and voice synthesis can function at the same time.

12029

(Gold 6)

Voice Recognition application may crash when there are multiple pipelines or multiple sessions of voice recognition

9302 (Gold 3)

The voice_recognition.cs sample may fail to return to a functional state after a few

times of resuming from S3/S4 power states. 6765 (Beta3)

The recognition rate of the voice recognition module may drop on systems in full load. To achieve the best recognition results, always use voice recognition when the CPU utilization is below 50%.

5831

(Beta2)

The voice recognition or synthesis module may hang an application if the user presses CTRL-C (CTRL-BREAK) to exit the application. To work around, do not interrupt the

Nuance module process.

5831 (Beta2)

The voice_recognition module may pop up a DOS window during exit. 6765 (Beta3)

Starting and stopping the voice recognition module may take up to 30 seconds. 5831

(Beta2)

The recognition latency may increase if the application delivers a large audio buffer at input. For responsiveness, it is recommended that the application streams audio data with an interval of 24-40 milliseconds.

5831 (Beta2)

The voice recognition module leaks memory. For about an hour, the memory increases from 11 MB to 40 MB.

5831 (Beta2)

Page 20: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

Total Immersion* Object Tracking Module:

Issue Description Release

Possible unhandled exception may occur during a user callback when running DFusionStudioCV_IntelPCSDK.exe. This seems to occur when using an integrated

webcam instead of the Creative Senz 3D* Camera.

6765 (Beta3)

Utilities and Tools:

Issue Description Release

The trace feature in sdk_info is functional in Gold R6. The tool now shows preview of tracing controls and tracing information can be viewed as well.

12029 (Gold 6)

The Privacy Notification Tool (PNT) does not display info about applications that were launched prior to the launch of the PNT.

7383 (Gold)

The PNT may not correctly display the displaying activities in the capture_viewer tool. 6765 (Beta3)

The UtilCaptureFile recording utility implementation currently has a 2GB limitation

on 32-bit systems.

5057 (Beta1)

The source under sample/common is prebuilt in libpxcutils.lib against Microsoft

Visual Studio 2008 and Microsoft Visual Studio 2010 projects. For other compiler versions, the developer will have to rebuild them against the source.

5057 (Beta1)

The PNT must be launched with run as administrator privilege so it can insert itself in

the taskbar tray.

5057 (Beta1)

Language ports, frameworks and game engines

Issue Description Release

The precompiled library ipppxcu_l.lib (used to support Unity, Processing and other frameworks) lacked the

Intel® AVX2 code path, which resulted in crash on Intel® Third Generation Intel processors. The fix added

the Intel® AVX2 code path.

9302

(Gold 3)

Java process is still running in the background if the user closes the Processing IDE window (as opposite to closing the Processing rendering window.) This is a known issue but not a SDK defect. The user should close the Processing rendering window

then the Processing IDE window.

5057 (Beta1)

The Unity sample function PXCUPipeline.Init is hard-coded to initialize only once.

This is necessary as the Unity engine calls the function multiple times during

initialization. To change to a different configuration, restart Unity.

5057 (Beta1)

Installer/Uninstaller:

Description Release

The SDK installer does not correctly detect Microsoft Windows XP 64-bit as an unsupported OS. During installation, some SDK components will refuse to install. To

6765 (Beta3)

Page 21: Intel® Perceptual Computing SDK 2013 GOLD Release Notes

work around, do not install SDK on Microsoft Windows XP.

The Intel64 runtime may not be installed by developer package if newer version of

IA32 runtime exists on system.

6765

(Beta3)

Uninstallation of developer package may not work if newer runtime is installed on top of it.

6765 (Beta3)

In certain cases, the SDK installer/uninstaller may not be able to cleanly exit the

privacy notification utility, which runs as a taskbar service. The installation or uninstallation process will terminate. To work around, manually exit the privacy notification utility before installing a new SDK version or uninstalling an existing SDK version.

5831

(Beta2)

To fully uninstall the Nuance Dragon Assistant Core Recognition Service and Dragon Assistant Language Data modules, you must select each one ands right click uninstall

in the Program and Features application in the control panel or you can just double click the intel_pc_sdk_voice_ia32_11.exe installer

5831 (Beta2)

Samples and Demos:

Issue Description Release

camera_viewer sample description is documented in CHM and PDF, is yet

needs to be added in HTML documentation. 13842 (Gold 8)

Compile error may be seen in gesture_viewer and other samples under VS2012, need to reference windows.h

instead of afxres.h for addressing this. PCSDK (Gold 6)

Capture Viewer app hangs while streaming from more than one camera. 9302 (Gold 3)

The installer package missed including prebuilt executable for the AugmentedFarm

demo and the GlassBalls demo. To work around, run the demos in the Unity Pro

editor 3.5.1 (the GlassBalls demo) and 4.1 (the AugmentedFarm demo.)

7383 (Gold)

The audio_recorder.exe sample may crash when unplugging USB camera

w/microphone during live streaming.

6765 (Beta3)

The depth_smoothing.exe sample may crash when unplugging USB camera

w/microphone during live streaming.

6765 (Beta3)

Possible memory leaks when face and landmark detection samples. The rate may vary from 5MB-15MB per hour of working set memory.

6765 (Beta3)

The gesture_viewer.exe sample may detect hand points on the face particularly if the

face is too close hands.

6765

(Beta3)