Esper FAQ

General

What is Complex Event Processing (CEP)?
Where does Complex Event Processing fit in the 4D model (Detect-Derive-Decide-Do)
Can you compare Esper to stream processing platforms please?
How does Esper compare to other CEP products?
How does Esper scale?
Is Esper memory efficient?
What part of Esper is in-memory computing?
What latency can be achieved and is it "real-time"?
What are the advantages of Esper's Event Processing Language (EPL)?
What business areas/problems is Esper best suited for?
Please clarify some common misconceptions?
What might be some misuses for it?
What is the intended audience and what is their interface?
Who uses Esper?
What is the concept or philosophy behind the design?
How has this been tested? What guarantees do I have that the next release works just as well?
How do you issue bug fixes or patches? How are problems tracked?
What operating systems has it been tested on?
It claims to be fast...how does it do that? Has this claim been tested?
Do you have any benchmarks available for Esper?
Can Esper handle a large number of statements?

How does it work?

How does Esper work? How does Esper allow you to search and match patterns on temporal events?
What algorithms does Esper use? Is it based on research?
What is the difference between Esper and an in-memory database?
How does the runtime discern which data to retain? Is it based solely on the statements registered with the runtime at the time the event comes in?
What or how many events does Esper keep in memory? Does Esper keep matching events of a statement in memory?
What happens on runtime start? I assume that if I have a time based statement, since there's no history within the runtime, there's no way to get any events to fire until the time and events have been consumed?
Can statements be added to the system only on runtime start, or can they be added dynamically? Will those statements work with any internally stored historical data when they're started?
When working with composite streams, i.e. when using the 'insert into' mechanism, does the entity being inserted into have to be a registered object in the system or are those created simply by registering the statement?
Could you explain the concept of data windows for a database programmer?
What is the difference between "select * from MyEvent" and "select* from MyEvent#length(1)" and "select * from MyEvent#keepall()" ?
What happens if I send events to which no statement is created yet? Can a time-to-live be specified to retry matching an event?
How can the timestamp for an event be explicitly controlled? What to do with event occurrence time, time synchronization or event timestamps?
Does the runtime make a copy of events?

Integration

What additional components does Esper require to run?
Can I run it with multiple threads? What, if anything, is multithread-safe?
What is the footprint of Esper in a typical installation, i.e. what is the RAM, disk and CPU usage?
If one overloads Esper with events through a queue, will Esper queue events internally until it processes them or will events stay in the queue until Esper process them?
Is there a way to send a bunch of events into Esper and get back a notification when it is done with processing all events?
What is the policy you recommend on UpdateListener (or Observer) if UpdateListener does long processing? For example, why don't you create a thread or attach a queue for output in some examples?
Have you tested Esper in an OSGI container?
What ClassLoader does it use? How do I get the class loading right in OSGI, Apache Axis or other containers?
Can Esper run on small devices?
How do I test EPL? Is there integration with a testing framework?

General

What is Complex Event Processing (CEP)?

Complex event processing, or CEP, is event processing that combines data from multiple sources to infer events or patterns that suggest more complicated circumstances. The goal of complex event processing is to identify meaningful events (such as opportunities or threats) and respond to them as quickly as possible (source: Wikipedia).

CEP aims at detecting situations. CEP is not a general-purpose application code container or a distributed processing platform. CEP helps in detecting situations by providing a declarative language (event processing language, EPL) or other abstractions to make situation detection easier and faster.

Typically, CEP is state-ful analysis since in order to detect situations there is a need to remember certain things. For example for simple counting we need to remember the current count and for pattern matching we need to remember partially-matched patterns. The current count and partially-matched patterns are for this example the state to keep.

Typically, situations must be detected as soon as they occur. Detection latency is the time between an event arriving and a situation being detected. CEP aims for detection latencies that are clearly below 1 second. Detection latencies between 1 nano-second to 1 millisecond are often desired and are achievable with CEP.

Typically, CEP deals with large amounts of small data points (events) arriving. Many of those events may cause state to change. Therefore state can be frequently updated and fast-changing.

Typically, CEP analysis means that the state is not the events themselves but the information derived from the events. For example, when simply counting, the derived information is the count and the event itself contributing to the count is not remembered. For example, when pattern matching, that fact that an event arrived is state however the events contributing to the pattern match may not need to be remembered.

Typically, CEP is interested in time passing which can be event-time or other time. State can change and expire based on time passing.

Where does Complex Event Processing fit in the 4D model (Detect-Derive-Decide-Do)

The purpose of CEP is analyzing events and finding situations of interest. CEP detects and derives information, so you can become aware of a situation immediately and react in the best possible way.

An example situation to be detected is: A suspicious account is derived whenever there are at least three large cash deposits in the last 15 days.

The "Detect" is about the raw event, for example a cash deposit event.
The "Derive" is about the situation, i.e. "did something happen?", for example there is a suspicious account.
The "Decide" is about the decision of what to do, for example the decision to determine a risk score or determine another course of action
The "Do" is the action, for example an action that opens an investigation

The "Detect" and "Derive" are the responsibility of CEP. CEP is time and event-driven and continuous in nature. It deals with a stream (historical or currently-arriving) of pre-defined but open-ended events, different event types, along with associated event data and may have more than one input source.

The "Decide" is sometimes handled by decision management tools or rules engines, as their strength lies in decision tables and fact based analysis. Decision management tools are generally request-driven seeking conclusion to a current business decision by running a fact analysis with one execution.

The "Do" is sometimes handled by business process or workflow tools.

Â	Complex Event Processing	Stream Processing
Also Known As	CEP, event stream processing, event series analysis	Real-time computation, stream computing
Â	Â	Â
Example Providers	Esper	(in no particular order, just to mention a few, there are significant others) Storm Spark Streaming Samza Flink
Â	Â	Â
Type	Business Intelligence and Decision-Making	Container Technology. Containers are mutually exclusive to each other; i.e. running Storm within Spark Streaming or J2EE does not make sense.
Â	Â	Â
Framework/Platform versus Language/Compiler/Runtime	Esper is a language, a compiler and a runtime (similar to Scala) that operates on top of the JVM. Esper, EsperHA and some parts of Enterprise Edition are components that can run as part of any JVM-based software stack. Enterprise Edition provides both a masterless horizontal scale-out platform as well as a more server-centric architecture.	Framework, platform or system.
Â	Â	Â
Pattern matching and detection, filtering, transformation, aggregation, event hierarchies, detecting relationships (such as causality, membership or timing) between events, managing event lifecycle	Central to Esper and CEP	Not central to stream processing
Â	Â	Â
Transporting events between processes and hosts	Not central to Esper and CEP in general. Enterprise Edition however addresses this requirement.	Central to stream processing
Â	Â	Â
Distribution and fault-tolerance	A central concern of Enterprise Edition.	A central concern
Â	Â	Â
Embeddable runtime	Esper, EsperHA and Enterprise Edition components are embeddable into any JVM process regardless of JVM language. They are supported by EsperTech when used with Storm, Samza, Spark, Flink and Akka.	Not generally embeddable (with exceptions), may require specific JVM launch and OS
Â	Â	Â
Schemas	Larger variety of schema types. Less likely to be unstructured data.	Likely more relational and flat but not necessarily, sometimes unstructured data.
Â	Â	Â
Sharing events among many use cases	For CEP, sharing events across many statements or many patterns is a central problem.	Topologies typically have few hand-programmed operators. Tends to place a higher emphasis on high data volumes with relatively fewer statements.
Â	Â	Â
Continuous Queries (Statements)	Express stream analysis in event processing language (EPL); Compile using the compiler; Deploy into a runtime; (all at runtime); no need to restart the server or container	Code your own operators, subclass framework classes, package, configure servers, deploy, stop the process or job, redeploy; No means to add continuous queries on-the-fly
Â	Â	Â
Target	Analysis	Extract-Transform-Load (ETL), Distributed Remote Procedure Call (DRPC), Integrating systems, Simple aggregation

Element	Class %	Method %	Line %
com.espertech.esper.common	93%	75%	80%
com.espertech.esper.compiler	97%	58%	78%
com.espertech.esper.runtime	67%	61%	69%

General

What is Complex Event Processing (CEP)?

Where does Complex Event Processing fit in the 4D model (Detect-Derive-Decide-Do)

Can you compare Esper to stream processing platforms please?

How does Esper compare to other CEP products?

How does Esper scale?

Is Esper memory efficient?

What part of Esper is in-memory computing?

What latency can be achieved and is it "real-time"?

What are the advantages of Esper's Event Processing Language (EPL)?

What business areas/problems is Esper best suited for?

Please clarify some common misconceptions?

What might be some misuses for it?

What is the intended audience and what is their interface?

Who uses Esper?

What is the concept or philosophy behind the design?

How has this been tested? What guarantees do I have that the next release works just as well?

How do you issue bug fixes or patches? How are problems tracked?

What operating systems has it been tested on?

It claims to be fast...how does it do that? Has this claim been tested?

Do you have any benchmarks available for Esper?

Can Esper handle a large number of statements?

How does it work?

How does Esper work? How does Esper allow you to search and match patterns on temporal events?

What algorithms does Esper use? Is it based on research?

What is the difference between Esper and an in-memory database?

How does the runtime discern which data to retain? Is it based solely on the statements registered with the runtime at the time the event comes in?

What or how many events does Esper keep in memory? Does Esper keep matching events of a statement in memory?

What happens on runtime start? I assume that if I have a time based statement, since there's no history within the runtime, there's no way to get any events to fire until the time and events have been consumed?

Can statements be added to the system only on runtime start, or can they be added dynamically? Will those statements work with any internally stored historical data when they're started?

When working with composite streams, i.e. when using the 'insert into' mechanism, does the entity being inserted into have to be a registered object in the system or are those created simply by registering the statement?

Could you explain the concept of data windows for a database programmer?

What is the difference between "select * from MyEvent" and "select* from MyEvent#length(1)" and "select * from MyEvent#keepall()" ?

What happens if I send events to which no statement is created yet? Can a time-to-live be specified to retry matching an event?

How can the timestamp for an event be explicitly controlled? What to do with event occurrence time, time synchronization or event timestamps?

Does the runtime make a copy of events?

Integration

What additional components does Esper require to run?

Can I run it with multiple threads? What, if anything, is multithread-safe?

What is the footprint of Esper in a typical installation, i.e. what is theRAM, disk and CPU usage?

If one overloads Esper with events through a queue, will Esper queue events internally until it processes them or will events stay in the queue until Esper process them?

Is there a way to send a bunch of events into Esper and get back a notification whenit is done with processing all events?

What is the policy you recommend on UpdateListener (or Observer) if UpdateListener does long processing? For example, why don't you create a thread or attach a queue for output in some examples?

Have you tested Esper in an OSGI container?

What ClassLoader does it use? How do I get the class loading right in OSGI, ApacheAxis or other containers?

Can Esper run on small devices?

How do I test EPL? Is there integration with a testing framework?