String encoding and unicode issues¶

This section mostly concerns Python 2.

When the documentation says that a str is accepted, an unicode is also always accepted on Python 2. What is more, when the documentation says that a str is returned or passed to a callback, on Python 2 it is actually a unicode (you mostly don’t need to care about that though, because most string operations on Python 2 allow mixing str and unicode).

When passing a bytes object (or, equivalently, a str object on Python 2) to a SDK function that says that it accepts a str, the bytes will be interpreted as being UTF-8 encoded. Beware: If the string has invalid UTF-8 (e.g. Latin-1/ISO-8859-1, as it may occur in HTTP headers), the function to which it was passed may fail either partially or fully. Such failures are guaranteed to neither throw exceptions nor violate any invariants of the involved objects, but some or all of the information passed in that function call may be lost (E.g. a single invalid HTTP header passed to oneagent.sdk.SDK.trace_incoming_web_request() may cause an null-tracer to be returned – but it is also allowed to e.g. truncate that HTTP header and discard all that follow; the exact failure mode is undefined and you should take care to not pass invalid strings). Also, the diagnostic callback (oneagent.sdk.SDK.set_diagnostic_callback()) may be invoked (but is not guaranteed to).

String encoding and unicode issues¶

Previous topic

Next topic