Other formats

This page describes our review of existing formats and tries to explain, why we are not using one of them. It works as summarization of benefits and drawbacks of each of them, together with our subjective remarks.

Intrusion Detection Message Exchange Format is format created exactly for exchange of information about security events between detection probes.

It is based on and very tightly coupled with XML and its structure makes heavy use of its paradigms.

It has rigid structure and some fields are dynamic (it is for example able to represent more sources or destinations of the attack). Structure is also very deep and wordy, for example to address attack source IP address, we have to use locator “Alert.Source.Node.Address.address”. Some fields are also recursive, so depth of its structure is not limited and can be arbitrary.

IDMEF has limited means to allow extensibility:

by global key/value pairs, which are however tied to message, not to specific source, target or other key
by own XML namespace in specific place of the structure (which brings in need for full fledged XML parser and higher complexity in IDMEF tools)

Specification often mentions subclassing and aggregation, however these are reserved for specification authors and (possibly) for some next official versions of IDMEF.

Format is often redundant – timestamps are represented both in machine readable format and human representation, which creates ambiguity in case of error (which representation should be authoritative?).

Also it is sometimes inconsistent – it supports long list of historic (or even obsolete) network protocols in one field, however URL and SNMP are classes on its own.

Incident type is free text, but there exist specific classes for buffer overflow, correlation, tool – but for nothing else.

Format can be validated against the thoroughly defined schema (which is a good thing), however schema itself is not enough – some cases must be validated specifically (for example timestamps, IP addresses).

IDMEF is in fact the basis for our design, it is format which is able to describe the widest number of incident types. The basic problems are its verbosity, depth and attempts to describe “everything”. According to our experiences, several means to group and structure various types of information are never used in real life scenarios. Its drawback is also the need for complex libraries – lightweight detection probes are usually not able to generate IDMEF messages directly and need some kind of intermediate format and translator.

Other formats

IDMEF

X-ARF

IODEF

Warden

AbuseHelper