Esper Reference

The example below aggregates price of each OrderEvent in the last 10 seconds computing a total price:

{1, 2, 3}.aggregate(0, (result, value) => result + value)  // Returns 6

// Initialization value is zero.
// Aggregate by adding up the price.
select window(*).aggregate(0, (result, order) => result + order.price) as totalPrice
from OrderEvent#time(10)

In the statement above, the initialization value is zero, result is used for the last aggregated value and order denotes the element that the expression adds the value of the price property.

This example aggregation builds a comma-separated list of all asset ids of all items:

select items.aggregate('', 
  (result, item) => result || (case when result='' then '' else ',' end) || item.assetId) as assets			
from LocationReport

In above statement, the empty string '' represents the initialization value. The name result is used for the last aggregated value and the name item is used to denote the element.

The type value returned by the initialization expression must match to the type of value returned by the accumulator lambda expression.

If the input is null the method returns null. If the input is empty the method returns the initialization value.

11.6.2. AllOf

The allof enumeration method determines whether all elements satisfy the predicate condition.

The method takes a single parameter: The predicate lambda expression that must yield a Boolean result. The enumeration method applies the lambda expression to each element and if the expression returns true for all elements, the method returns true.

The statement below returns true when all items are within 1000 unit distance of center:

{1, 2, 3}.allOf(v => v > 0)    // Returns true as all values are > 0
{1, 2, 3}.allOf(v => v > 1)    // Returns false

select items.allof(i => distance(i.location.x, i.location.y, 0, 0) < 1000) as centered			
from LocationReport

If the input is null the method returns null. If the input is empty the method returns true.

11.6.3. AnyOf

The anyof enumeration method determines whether any element satisfies the predicate condition.

The only parameter is the predicate lambda expression that must yield a Boolean result. The enumeration method applies the lambda expression to each element and if the expression returns true for all elements, the method returns true.

The statement below return true when any of the items are within 10 unit distance of center:

{1, 2, 3}.anyOf(v => v > 0)    // Returns true
{1, 2, 3}.anyOf(v => v > 1)    // Returns true
{1, 2, 3}.anyOf(v => v > 3)    // Returns false

select items.anyof(i => distance(i.location.x, i.location.y, 0, 0) < 10) as centered			
from LocationReport

If the input is null the method returns null. If the input is empty the method returns false.

11.6.4. Average

The average enumeration method computes the average of scalar values. If passing a projection lambda expression the method computes the average obtained by invoking the projection lambda expression on each element.

The method takes a projection lambda expression yielding a numeric value as a parameter. It applies the lambda expression to each element and computes the average of the result, returning a Double value. A BigDecimal is returned for expressions returning BigInteger or BigDecimal.

The statement as shown next computes the average distance from center among all items in the location report event:

{1, 2, 3}.average()    // Returns 2

select items.average(i => distance(i.location.x, i.location.y, 0, 0)) as avgdistance
from LocationReport

If the input is null the method returns null. If the input is empty the method returns double zero or BigDecimal zero. For BigDecimal precision and rounding, please see Section 16.5.6.5, “Math Context”.

11.6.5. CountOf

The countof enumeration method returns the number of elements, or the number of elements that satisfy a condition.

The enumeration method has two versions: The first version takes no parameters and computes the number of elements. The second version takes a predicate lambda expression that must yield Boolean true or false, and computes the number of elements that satisfy the condition.

The next sample statement counts the number of items:

{1, 2, 3}.countOf()    // Returns 3
{1, 2, 3}.countOf(v => v < 2)    // Returns 1

select items.countOf() as cnt from LocationReport

This example statement counts the number of items that have a distance to center that is less than 20 units:

select items.countOf(i => distance(i.location.x, i.location.y, 0, 0) < 20) as cntcenter
from LocationReport

If the input is null the method returns null. If the input is empty the method returns integer zero.

11.6.6. DistinctOf

The distinctOf enumeration method returns distinct elements.

The enumeration method can take a single key-selector lambda expression as parameter and returns distinct elements according to the key yielded by the expression. For same-value keys, distinct returns the first element for that key.

This example returns items distinct by item id returning the first item for each distinct item id:

{2, 3, 2, 1}.distinctOf()   // Returns {2, 3, 1}

select items.distinctOf(i => itemId) as itemsNearFirst
from LocationReport

The key-selector lambda expression, when provided, must return a comparable type: Any primitive or boxed or Comparable type is permitted.

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.7. Except

The except enumeration method forms a set difference of the input elements with the elements that the parameter expression yields.

The enumeration method takes a single parameter that must itself return a collection of events, objects or scalar values. The method returns the elements of the first collection that do not appear in the second collection.

The following statement compares the items of the last location report against all items in the previous 10 location reports, and reports for each combination only those items in the current item report that are not also in the location report compared to:

{1, 2, 3}.except({1})   // Returns {2, 3}

select za.items.except(zb.items) as itemsCompared 
from LocationReport as za unidirectional, LocationReport#length(10) as zb

If the input is null the method returns null. For scalar values and objects equals-semantics apply.

11.6.8. FirstOf

The firstOf enumeration method returns the first element or the first element that satisfies a condition.

The method has two versions: The first version takes no parameters and returns the first element. The second version takes a predicate lambda expression yielding true or false. It applies the lambda expression to each element and returns the first element for which the expression returns true. The return type is the element itself and not a collection. You may append a property name to return the property value for the first element.

In the following EPL sample the statement returns the first item that has a distance to center that is less than 20 units:

{1, 2, 3}.firstOf()   // Returns 1
{1, 2, 3}.firstOf(v => v / 2 > 1)   // Returns 3

select items.firstof(i => distance(i.location.x, i.location.y, 0, 0) < 20) as firstcenter
from LocationReport

The next sample EPL returns the first item's asset id:

select items.firstof().assetId as firstAssetId from LocationReport

If the input is null, empty or if none of the elements match the condition the method returns null.

11.6.9. GroupBy

The groupby enumeration method groups the elements according to a specified key-selector lambda expression. There are two version of the groupby method.

The first version of the method takes a key-selector lambda expression and returns a Map of key with each value a list of objects, one for each distinct key that was encountered. The result is a Map<Object, Collection<Object>> wherein object is the event underlying object.

The second version of the method takes a key-selector lambda expression and value-selector lambda expression and returns a Map of key with each value a list of values, one for each distinct key that was encountered. The result is a Map<Object, Collection<Object>> wherein object is the result of applying the value-selector expression.

The next statement filters out all luggage items using a where method and then groups by the luggage's passenger asset id. It returns a map of passenger asset id and the collection of luggage items for each passenger:

select items.where(type='L').groupby(i => assetIdPassenger) as luggagePerPerson
from LocationReport

The statement shown below generates a map of item asset id and distance to center:

select items.groupby(
    k => assetId, v => distance(v.location.x, v.location.y, 0, 0)) as distancePerItem
from LocationReport

If the input is null the method returns null. Null values as key and value are allowed.

11.6.10. Intersect

The intersect enumeration method forms a set intersection of the input elements with the elements that the parameter expression yields.

The following statement compares the items of the last location report against all items in the previous 10 location reports, and reports for each combination all items in the current item report that also occur in the other location report:

{1, 2, 3}.intersect({2, 3})   // Returns {2, 3}

select za.items.intersect(zb.items) as itemsCompared 
from LocationReport as za unidirectional, LocationReport#length(10) as zb

If the input is null the method returns null. For scalar values and objects equals-semantics apply.

11.6.11. LastOf

The lastOf enumeration method returns the last element or the last element that satisfies a condition.

The method has two versions: The first version takes no parameters and returns the last element. The second version takes a predicate lambda expression yielding true or false. It applies the lambda expression to each element and returns the last element for which the expression returns true. The return type is the element itself and not a collection. You may append a property name to return the property value for the last element.

In the following EPL sample the statement returns the last item that has a distance to center that is less than 20 units:

{1, 2, 3}.lastOf()   // Returns 3
{1, 2, 3}.lastOf(v => v < 3)   // Returns 2

select items.lastof(i => distance(i.location.x, i.location.y, 0, 0) < 20) as lastcenter 
from LocationReport

The next sample EPL returns the last item's asset id:

select items.lastof().assetId as lastAssetId from LocationReport

If the input is null, empty or if none of the elements match the condition the method returns null.

11.6.12. LeastFrequent

The leastFrequent enumeration method returns the least frequent value among a collection of values, or the least frequent value after applying a transform expression to each element.

The method has two versions: The first version takes no parameters and returns the least frequent value. The second version takes a transform lambda expression yielding the value to count occurrences for. The method applies the lambda expression to each element and returns the expression result value with the least number of occurrences. The return type is the type of value in the collection or the type of value returned by the transform lambda expression if one was provided.

The example EPL below returns the least frequent item type, counting the distinct item types among all items for the current LocationReport event:

{1, 2, 3, 2, 1}.leastFrequent()   // Returns 3

select items.leastFrequent(i => type) as leastFreqType from LocationReport

If the input is null or empty the method returns null. The transform expression may also yield null. A null value can be returned as the most frequent value if the most frequent value is null. If multiple values have the same number of occurrences the method returns the first value with the least number of occurrences considering the ordering of the collection.

11.6.13. Max

The max enumeration method returns the maximum value among a collection of values.

If no value-selector lambda expression is provided, the method finds the maximum.

If a value-selector lambda expression is provided, the enumeration method invokes a value-selector lambda expression on each element and returns the maximum value. The type of value returned follows the return type of the lambda expression that was provided as parameter.

The next statement returns the maximum distance of any item from center:

{1, 2, 3, 2, 1}.max()   // Returns 3

select items.max(i => distance(i.location.x, i.location.y, 0, 0)) as maxcenter 
from LocationReport

The value-selector lambda expression must return a comparable type: Any primitive or boxed type or Comparable type is permitted.

If the input is null, empty or if none of the elements when transformed return a non-null value the method returns null.

11.6.14. MaxBy

The maxBy enumeration method returns the element that provides the maximum value returned by the value-selector lambda expression when applied to each element.

The enumeration method returns the element itself. You may append an event property name to return a property value of the element.

The next statement returns the first item with the maximum distance to center:

select items.maxBy(i => distance(i.location.x, i.location.y, 0, 0)) as maxItemCenter 
from LocationReport

The next sample returns the type of the item with the largest asset id (string comparison) among all items:

select items.maxBy(i => assetId).type as minAssetId from LocationReport

The transform expression must return a comparable type: Any primitive or boxed type or Comparable type is permitted.

If the input is null, empty or if none of the elements when transformed return a non-null value the method returns null.

11.6.15. Min

The min enumeration method returns the minimum value among a collection of values.

If no value-selector lambda expression is provided, the method finds the minimum.

If a value-selector lambda expression is provided, the enumeration method invokes a value-selector lambda expression on each element and returns the minimum value. The type of value returned follows the return type of the lambda expression that was provided as parameter.

The next statement returns the minimum distance of any item to center:

{1, 2, 3, 2, 1}.min()   // Returns 1

select items.min(i => distance(i.location.x, i.location.y, 0, 0)) as mincenter 
from LocationReport

The transform expression must return a comparable type: Any primitive or boxed type or Comparable type is permitted.

If the input is null, empty or if none of the elements when transformed return a non-null value the method returns null.

11.6.16. MinBy

The minBy enumeration method returns the element that provides the minimum value returned by the value-selector lambda expression when applied to each element.

The enumeration method returns the element itself. You may append an event property name to return a property value of the element.

The next statement returns the first item with the minimum distance to center:

select items.minBy(i => distance(i.location.x, i.location.y, 0, 0)) as minItemCenter 
from LocationReport

The next sample returns the type of the item with the smallest asset id (string comparison) among all items:

select items.minBy(i => assetId).type as minAssetId from LocationReport

The transform expression must return a comparable type: Any primitive or boxed or Comparable type is permitted.

If the input is null, empty or if none of the elements when transformed return a non-null value the method returns null.

11.6.17. MostFrequent

The mostFrequent enumeration method returns the most frequent value among a collection of values, or the most frequent value after applying a transform expression to each element.

The method has two versions: The first version takes no parameters and returns the most frequent value. The second version takes a transform lambda expression yielding the value to count occurrences for. The method applies the lambda expression to each element and returns the expression result value with the most number of occurrences. The return type is the type of value in the collection or the type of value returned by the transform lambda expression if one was provided.

The example EPL below returns the least frequent item type, counting the distinct item types among all items for the current LocationReport event:

{1, 2, 3, 2, 1, 2}.mostFrequent()   // Returns 2

select items.leastFrequent(i => type) as leastFreqType from LocationReport

If the input is null or empty the method returns null. The transform expression may also yield null. A null value can be returned as the most frequent value if the most frequent value is null. If multiple values have the same number of occurrences the method returns the first value with the most number of occurrences considering the ordering of the collection.

11.6.18. OrderBy and OrderByDesc

The orderBy enumeration method sorts elements in ascending order according to a key. The orderByDesc enumeration method sorts elements in descending order according to a key.

The enumeration method takes a single key-selector lambda expression as parameter and orders elements according to the key yielded by the expression. For same-value keys, it maintains the existing order.

This example orders all items from a location report according to their distance from center:

{2, 3, 2, 1}.orderBy()   // Returns {1, 2, 2, 3}

select items.orderBy(i => distance(i.location.x, i.location.y, 0, 0)) as itemsNearFirst,
  items.orderByDesc(i => distance(i.location.x, i.location.y, 0, 0)) as itemsFarFirst
from LocationReport

The key-selector lambda expression must return a comparable type: Any primitive or boxed or Comparable type is permitted.

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.19. Reverse

The reverse enumeration method simply reverses the order of elements returning a collection.

The following EPL reverses the items:

{2, 3, 2, 1}.reverse()   // Returns {1, 2, 3, 2}

select items.reverse() as reversedItems from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.20. SelectFrom

The selectFrom enumeration method transforms each element resulting in a collection of transformed elements.

The enumeration method applies a transformation lambda expression to each element and returns the result of each transformation as a collection. Use the new operator to yield multiple values for each element, see Section 9.13, “The 'New' Keyword”.

The next statement returns a collection of asset ids:

select items.selectFrom(i => assetId) as itemAssetIds from LocationReport

This sample statement evaluates each item and returns the asset id as well as the distance from center for each item:

select items.selectFrom(i => 
  new {
    assetId, 
    distanceCenter = distance(i.location.x, i.location.y, 0, 0)
  } ) as itemInfo from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.21. SequenceEqual

The sequenceEqual enumeration method determines whether two collections are equal by comparing each element.

The method enumerates the two source collections in parallel and compares corresponding elements by using the equals method to compare. The method takes a single parameter expression that must return a collection containing elements of the same type as the input. The method returns true if the two source sequences are of equal length and their corresponding elements are equal.

The following example compares the asset id of all items to the asset ids returned by a method ItemUtil.redListed() which is assumed to return a list of asset id of string type:

{1, 2, 3}.sequenceEqual({1})   // Returns false
{1, 2, 3}.sequenceEqual({1, 2, 3})   // Returns true

select items.selectFrom(i => assetId).sequenceEquals(ItemUtil.redListed()) from LocationReport

If the input is null the method returns null.

11.6.22. SumOf

The sumOf enumeration method computes the sum. If a projection lambda expression is provided, the method invokes the projection lambda expression on each element and computes the sum on each returned value.

The projection lambda expression should yield a numeric value, BigDecimal or BigInteger value. Depending on the type returned by the projection lambda expression the method returns either Integer, Long, Double, BigDecimal or BigInteger.

The following example computes the sum of the distance of each item to center:

{1, 2, 3}.sumOf()   // Returns 6

select items.sum(i => distance(i.location.x, i.location.y, 0, 0) as totalAllDistances
from LocationReport

If the input is null or empty the method returns null.

11.6.23. Take

The take enumeration method returns a specified number of contiguous elements from the start.

The enumeration method takes a single size (non-lambda) expression that returns an Integer value.

The following example returns the first 5 items:

{1, 2, 3}.take(2)   // Returns {1, 2}

select items.take(5) as first5Items from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.24. TakeLast

The takeLast enumeration method returns a specified number of contiguous elements from the end.

The enumeration method takes a single size (non-lambda) expression that returns an Integer value.

The following example returns the last 5 items:

{1, 2, 3}.takeLast(2)   // Returns {2, 3}

select items.takeLast(5) as last5Items from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.25. TakeWhile

The takeWhile enumeration method returns elements from the start as long as a specified condition is true.

The enumeration method has two versions. The first version takes a predicate lambda expression and the second version takes a predicate lambda expression and index for use within the predicate expression. Both versions return elements from the start as long as the specified condition is true.

This example selects all items from a location report in the order provided until the first item that has a distance to center greater than 20 units:

{1, 2, 3}.takeWhile(v => v < 3)   // Returns {1, 2}
{1, 2, 3}.takeWhile((v,ind) => ind > 2)   // Returns {1, 2}
{1, 2, -1, 4, 5, 6}.takeWhile((v,ind) => ind < 5 and v > 0)  // Returns {1, 2} (Take while index<5 amd value>0)

select items.takeWhile(i => distance(i.location.x, i.location.y, 0, 0) < 20)
from LocationReport

The second version of the where represents the index of the input element starting at zero for the first element.

The next example is similar to the statement above but also limits the result to the first 10 items:

select items.takeWhile((i, ind) => distance(i.location.x, i.location.y, 0, 0) < 20) and ind < 10)
from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.26. TakeWhileLast

The takeWhileLast enumeration method returns elements from the end as long as a specified condition is true.

This example selects all items from a location report, starting from the last element and proceeding backwards, until the first item that has a distance to center greater than 20 units:

{1, 2, 3}.takeWhileLast(v => v < 3)   // Returns {} (empty collection)
{1, 2, 3}.takeWhileLast(v => v > 1)   // Returns {2, 3}
{1, 2, 3}.takeWhileLast((v,ind) => ind > 2)   // Returns {2, 3}
{1, 2, -1, 4, 5, 6}.takeWhileLast((v,ind) => ind < 5 and v > 0)  // Returns {4, 5, 6} (Take while index<5 amd value>0)

select items.takeWhile(i => distance(i.location.x, i.location.y, 0, 0) < 20)
from LocationReport

The second version provides the index of the input element starting at zero for the last element (reverse index).

The next example is similar to the statement above but also limits the result to the last 10 items:

select items.takeWhile((i, ind) => distance(i.location.x, i.location.y, 0, 0) < 20) and ind < 10)
from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

11.6.27. ToMap

The toMap enumeration method returns a Map according to specified key-selector lambda expression and value-selector lambda expression.

The enumeration method takes a key-selector expression and a value-selector expression. For each element the method applies the key-selector expression to determine the map key and the value-selector expression to determine the map value. If the key already exists in the map the value is overwritten.

The next example EPL outputs a map of item asset id and distance to center for each item:

select items.toMap(k => k.assetId, v => distance(v.location.x, v.location.y, 0, 0)) as assetDistance
from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty map.

11.6.28. Union

The union enumeration method forms a union of the input elements with the elements that the parameter expression yields.

The enumeration method takes a single parameter that must itself return a collection of events (input), objects or scalar values. It appends the collection to the input elements and returns the appended collection of elements.

This example selects a union of all items that have an asset id of L001 or that are of type passenger:

{1, 2, 3}.union({4, 5})   // Returns {1, 2, 3, 4, 5}

select items.where(i => i.assetId = 'L001')
    .union(items.where(i => i.type = 'P')) as itemsUnion
from LocationReport

If the input is null the method returns null.

11.6.29. Where

The where enumeration method filters elements based on a predicate.

The enumeration method has two versions. The first version takes a predicate lambda expression and the second version takes a predicate lambda expression and index for use within the predicate expression. Both version returns all elements for which the predicate expression is true.

This example selects all items from a location report that are passenger-type:

{1, 2, 3}.where(v => v != 2)   // Returns {1, 3}

select items.where(p => p.type = 'P') from LocationReport

The second version of the where represents the index of the input element starting at zero for the first element.

The example below selects all items from a location report that are passenger-type but ignores the first 3 elements:

select items.where((p, ind) => p.type = 'P' and ind > 2) from LocationReport

If the input is null the method returns null. If the input is empty the method returns an empty collection.

Chapter 12. EPL Reference: Date-Time Methods

12.1. Overview

EPL date-time methods work on date-time values to perform common tasks such as comparing times and time periods, adding or subtracting time periods, setting or rounding calendar fields and querying fields.

Date-time methods operate on:

Any expression or event property that returns one of the below values:
1. A long-type millisecond or microsecond value.
2. A java.util.Calendar object including subclasses.
3. A java.util.Date object including subclasses.
4. A java.time.LocalDateTime object including subclasses.
5. A java.time.ZonedDateTime object including subclasses.
Any event for which the event type declares a start timestamp property name and optionally also an end timestamp property name. Date-time methods operate on events by means of the stream-alias.method-name syntax.

The below table summarizes the built-in date-time methods available:

Table 12.1. Date-Time Methods

Method Result

after(event or timestamp)

Returns true if an event happens after another event, or a timestamp is after another timestamp.

Section 12.4.5, “After”.

before(event or timestamp)

Returns true if an event happens before another event, or a timestamp is before another timestamp.

Section 12.4.6, “Before”.

between(timestamp, timestamp, boolean, boolean)

Returns true if a timestamp is between two timestamps.

Section 12.3.1, “Between”.

coincides(event or timestamp)

Returns true if an event and another event happen at the same time, or two timestamps are the same value.

Section 12.4.7, “Coincides”.

during(event or timestamp)

Returns true if an event happens during the occurrence of another event, or when a timestamps falls within the occurrence of an event.

Section 12.4.8, “During”.

finishes(event or timestamp)

Returns true if an event starts after another event starts and the event ends at the same time as the other event.

Section 12.4.9, “Finishes”.

finishedBy(event or timestamp)

Returns true if an event starts before another event starts and ends at the same time as the other event.

Section 12.4.10, “Finished By”.

format()

format(format)

Formats the date-time returning a string.

Section 12.3.2, “Format”.

get(field)

Returns the value of the given date-time value field.

Section 12.3.3, “Get (By Field)”.

getMillisOfSecond()

getSecondOfMinute()

getMinuteOfHour()

getHourOfDay()

getDayOfWeek()

getDayOfMonth()

getDayOfYear()

getWeekyear()

getMonthOfYear()

getYear()

getEra()

Returns the value of the given date-time value field.

Section 12.3.4, “Get (By Name) ”.

includes(event or timestamp)

Returns true if the parameter event happens during the occurrence of the event, or when a timestamps falls within the occurrence of an event.

Section 12.4.11, “Includes”.

meets(event or timestamp)

Returns true if the event's end time is the same as another event's start time.

Section 12.4.12, “Meets”.

metBy(event or timestamp)

Returns true if the event's start time is the same as another event's end time.

Section 12.4.13, “Met By”.

minus(duration-millis)

Returns a date-time with the specified duration in long-type milliseconds taken away.

Section 12.3.5, “Minus”.

minus(time-period)

Returns a date-time with the specified duration in time-period syntax taken away.

Section 12.3.5, “Minus”.

overlaps(event or timestamp)

Returns true if the event starts before another event starts and finishes after the other event starts, but before the other event finishes (events have an overlapping period of time).

Section 12.4.14, “Overlaps”.

overlappedBy(event or timestamp)

Returns true if the parameter event starts before the input event starts and the parameter event finishes after the input event starts, but before the input event finishes (events have an overlapping period of time).

Section 12.4.15, “Overlapped By”.

plus(duration-millis)

Returns a date-time with the specified duration in long-type milliseconds added.

Section 12.3.6, “Plus”.

plus(time-period)

Returns a date-time with the specified duration in time-period syntax added.

Section 12.3.6, “Plus”.

roundCeiling(field)

Returns a date-time rounded to the highest whole unit of the date-time field.

Section 12.3.7, “RoundCeiling”.

roundFloor(field)

Returns a date-time rounded to the lowest whole unit of the date-time field.

Section 12.3.8, “RoundFloor”.

roundHalf(field)

Returns a date-time rounded to the nearest whole unit of the date-time field.

Section 12.3.9, “RoundHalf”.

set(field, value)

Returns a date-time with the specified field set to the value returned by a value expression.

Section 12.3.10, “Set (By Field)”.

starts(event or timestamp)

Returns true if an event and another event start at the same time and the event's end happens before the other event's end.

Section 12.4.16, “Starts”.

startedBy(event or timestamp)

Returns true if an event and another event start at the same time and the other event's end happens before the input event's end.

Section 12.4.17, “Started By”.

withDate(year,month,day)

Returns a date-time with the specified date, retaining the time fields.

Section 12.3.11, “WithDate”.

withMax(field)

Returns a date-time with the field set to the maximum value for the field.

Section 12.3.12, “WithMax”.

withMin(field)

Returns a date-time with the field set to the minimum value for the field.

Section 12.3.13, “WithMin”.

withTime(hour,minute,sec,msec)

Returns a date-time with the specified time, retaining the date fields.

Section 12.3.14, “WithTime”.

toCalendar()

Returns the Calendar object for this date-time value.

Section 12.3.15, “ToCalendar”.

toDate()

Returns the Date object for this date-time value.

Section 12.3.16, “ToDate”.

toMillisec()

Returns the long-type milliseconds value for this date-time value.

Section 12.3.17, “ToMillisec”.

12.2. How to Use

12.2.1. Syntax

The syntax for date-time methods is the same syntax as for any chained invocation:

input_val.datetime_method_name( [method_parameter [, method_parameter [,...]]])
	  .[ datetime_method_name(...) [...]]

Following the input_val input value is the . (dot) operator and the datetime_method_name date-time method name. It follows in parenthesis a comma-separated list of method parameter expressions. Additional date-time methods can be chained thereafter.

The input value can be any expression or event property that returns a value of type long or java.util.Calendar or java.util.Date or java.time.LocalDateTime or java.time.ZonedDateTime. If the input value is null, the expression result is also null.

The input value can also be an event. In this case the event type of the event must have the start timestamp property name defined and optionally also the end timestamp property name.

The following example statement employs the withTime date-time method. This example returns the current runtime time with the time-part set to 1 am:

select current_timestamp.withTime(1, 0, 0, 0) as time1am from MyEvent

As date-time methods can be chained, this EPL is equivalent:

select current_timestamp.set('hour', 1).set('min', 0).set('sec', 0).set('msec', 0) as time1am
from MyEvent

The statement above outputs in field time1am a long-type value (milliseconds or microseconds) reflecting 1am on the same date as runtime time. Since the input value is provided by the built-in current_timestamp function which returns current runtime time as a long-type value the output is also a long-type value.

You may apply a date-time method to an event property.

Assume that the RFIDEvent event type has a Date-type property by name timeTaken. The following statement rounds each time-taken value down to the nearest minute and outputs a Date-type value in column timeTakenRounded:

select timeTaken.roundFloor('min') as timeTakenRounded from RFIDEvent

You may apply a date-time method to events. This example assumes that the RFIDEvent and WifiEvent event types both have a timestamp property defined. The EPL compares the timestamps of the RFIDEvent and the WifiEvent:

select rfid.after(wifi) as isAfter 
from RFIDEvent#lastevent rfid, WifiEvent#lastevent wifi

For comparing date-time values and considering event duration (event start and end timestamps) we recommend any of the interval algebra methods. You may also compare long-type values using the between or in ranges and inverted ranges or relational operators (> , <, >=, <=).

From a performance perspective, the date-time method evaluation ensures that for each unique chain of date-time methods only a single calendar objects is copied or created when necessary.

12.3. Calendar and Formatting Reference

12.3.1. Between

The between date-time method compares the input date-time value to the two date-time values passed in and returns true if the input value falls between the two parameter values.

The synopsis is:

input_val.between(range_start, range_end [, include_start, include_end])

The method takes either 2 or 4 parameters. The first two parameters range_start and range_end are expressions or properties that yield either a long-typed, Date-typed or Calendar-typed range start and end value.

The next two parameters include_start and include_end are optional. If not specified, the range start value and range end value are included in the range i.e. specify a closed range where both endpoints are included. If specified, the expressions must return a boolean-value indicating whether to include the range start value and range end value in the range.

The example below outputs true when the time-taken property value of the RFID event falls between the time-start property value and the time-end property value (closed range includes endpoints):

select timeTaken.between(timeStart, timeEnd) from RFIDEvent

The example below performs the same test as above but does not include endpoints (open range includes neither endpoint):

select timeTaken.between(timeStart, timeEnd, false, false) from RFIDEvent

If the range end value is less then the range start value, the algorithm reverses the range start and end value.

If the input date-time value or any of the parameter values evaluate to null the method returns a null result value.

12.3.2. Format

The format date-time method formats the date-time returning a string.

The method takes either no parameter or a single format parameter.

12.3.2.1. Format with Default Formatter

When passing no parameter, the method returns the date-time value formatted using the default formatter as follows:

Table 12.2. RoundHalf Examples

Input	String Formatter
`long, Date, Calendar`	`new SimpleDateFormat()`
`java.time.LocalTimeDate`	`DateTimeFormatter.ISO_DATE_TIME`
`java.time.ZonedTimeDate`	`DateTimeFormatter.ISO_ZONED_DATE_TIME`

The example below outputs the time-taken property value of the RFID event:

select timeTaken.format() as timeTakenStr from RFIDEvent

12.3.2.2. Providing a Format

For input values that are long-typed, Date-typed or Calendar-typed you must provide an expression that returns either:

A String-type format that adheres to SimpleDateFormat rules.
A DateFormat instance.

For input values that are LocalDateTime-typed or ZonedDateTime-typed you must provide an expression that returns either:

A String-type format that adheres to DateTimeFormatter rules.
A DateTimeFormatter instance.

The runtime evaluates the format expression at statement compilation time therefore the format expression must return a value that is not computed from time or events.

For example:

select timeTaken.format('yyyy.MM.dd G 'at' HH:mm:ss') from RFIDEvent

select timeTaken.format(SimpleDateFormat.getDateInstance()) from RFIDEvent

select localDateTime.format(java.time.format.DateTimeFormatter.BASIC_ISO_DATE) from RFIDEvent

12.3.3. Get (By Field)

The get date-time method returns the value of the given date-time value field.

The method takes a single string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The method returns the numeric value of the field within the date-time value. The value returned adheres to Calendar-class semantics: For example, the value for month starts at zero and has a maximum of 11 (Note: for LocalDateTime and ZonedDateTime the range for month is 1 to 12).

The example below outputs the month value of the time-taken property value of the RFID event:

select timeTaken.get('month') as timeTakenMonth from RFIDEvent

12.3.4. Get (By Name)

The following list of getter-methods are available: getMillisOfSecond(), getSecondOfMinute(), getMinuteOfHour(), getHourOfDay(), getDayOfWeek(), getDayOfMonth(), getDayOfYear(), getWeekYear(), getMonthOfYear(), getYear() and getEra().

All get-methods take no parameter and return the numeric value of the field within the date-time value. The value returned adheres to Calendar-class semantics: For example, the value for month starts at zero and has a maximum of 11 (Note: for LocalDateTime and ZonedDateTime the range for month is 1 to 12).

The example below outputs the month value of the time-taken property value of the RFID event:

select timeTaken.getMonthOfYear() as timeTakenMonth from RFIDEvent

12.3.5. Minus

The minus date-time method returns a date-time with the specified duration taken away.

The method has two versions: The first version takes the duration as a long-type millisecond value. The second version takes the duration as a time-period expression, see Section 5.2.1, “Specifying Time Periods”.

The example below demonstrates the time-period parameter to subtract two minutes from the time-taken property value of the RFID event:

select timeTaken.minus(2 minutes) as timeTakenMinus2Min from RFIDEvent

The next example is equivalent but passes a millisecond-value instead:

select timeTaken.minus(2*60*1000) as timeTakenMinus2Min from RFIDEvent

12.3.6. Plus

The plus date-time method returns a date-time with the specified duration added.

The next example adds two minutes to the time-taken property value of the RFID event:

select timeTaken.plus(2 minutes) as timeTakenPlus2Min from RFIDEvent

The next example is equivalent but passes a millisecond-value instead:

select timeTaken.plus(2*60*1000) as timeTakenPlus2Min from RFIDEvent

12.3.7. RoundCeiling

The roundCeiling date-time method rounds to the highest whole unit of the date-time field.

The method takes a single string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The next example rounds-to-ceiling the minutes of the time-taken property value of the RFID event:

select timeTaken.roundCeiling('min') as timeTakenRounded from RFIDEvent

If the input time is 2002-05-30 09:01:23.050, for example, the output is 2002-05-30 09:02:00.000 (example timestamps are in format yyyy-MM-dd HH:mm:ss.SSS).

12.3.8. RoundFloor

The roundFloor date-time method rounds to the lowest whole unit of the date-time field.

The method takes a single string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The next example rounds-to-floor the minutes of the time-taken property value of the RFID event:

select timeTaken.roundFloor('min') as timeTakenRounded from RFIDEvent

If the input time is 2002-05-30 09:01:23.050, for example, the output is 2002-05-30 09:01:00.000 (example timestamps are in format yyyy-MM-dd HH:mm:ss.SSS).

12.3.9. RoundHalf

The roundFloor date-time method rounds to the nearest whole unit of the date-time field.

The method takes a single string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The next example rounds the minutes of the time-taken property value of the RFID event:

select timeTaken.roundHalf('min') as timeTakenRounded from RFIDEvent

The following table provides a few examples of the rounding (example timestamps are in format yyyy-MM-dd HH:mm:ss.SSS):

Table 12.3. RoundHalf Examples

Input	Output
2002-05-30 09:01:23.050	2002-05-30 09:01:00.000
2002-05-30 09:01:29.999	2002-05-30 09:01:00.000
2002-05-30 09:01:30.000	2002-05-30 09:02:00.000

This method is not support for LocalDateTime and ZonedDateTime input values.

12.3.10. Set (By Field)

The set date-time method returns a date-time with the specified field set to the value returned by an expression.

The method takes a string-constant field name and an expression returning an integer-value as parameters. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The method returns the new date-time value with the field set to the provided value. Note that value adheres to Calendar-class semantics: For example, the value for month starts at zero and has a maximum of 11 (Note: for LocalDateTime and ZonedDateTime the range for month is 1 to 12).

The example below outputs the time-taken with the value for month set to April:

select timeTaken.set('month', 3) as timeTakenMonth from RFIDEvent

12.3.11. WithDate

The withDate date-time method returns a date-time with the specified date, retaining the time fields.

The method takes three expressions as parameters: An expression for year, month and day.

The method returns the new date-time value with the date fields set to the provided values. For expressions returning null the method ignores the field for which null is returned. Note the Calendar-class semantics: For example, the value for month starts at zero and has a maximum of 11.

The example below outputs the time-taken with the date set to May 30, 2002:

select timeTaken.withDate(2002, 4, 30) as timeTakenDated from RFIDEvent

12.3.12. WithMax

The withMax date-time method returns a date-time with the field set to the maximum value for the field.

The method takes a string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The method returns the new date-time value with the specific date field set to the maximum value.

The example below outputs the time-taken property value with the second-part as 59 seconds:

select timeTaken.withMax('sec') as timeTakenMaxSec from RFIDEvent

12.3.13. WithMin

The withMin date-time method returns a date-time with the field set to the minimum value for the field.

The method takes a string-constant field name as parameter. Please see Section 5.2.1, “Specifying Time Periods” for a list of recognized keywords (not case-sensitive).

The method returns the new date-time value with the specific date field set to the minimum value.

The example below outputs the time-taken property value with the second-part as 0 seconds:

select timeTaken.withMin('sec') as timeTakenMaxSec from RFIDEvent

12.3.14. WithTime

The withTime date-time method returns a date-time with the specified time, retaining the date fields.

The method takes four expressions as parameters: An expression for hour, minute, second and millisecond.

The method returns the new date-time value with the time fields set to the provided values. For expressions returning null the method ignores the field for which null is returned.

The example below outputs the time-taken with the time set to 9am:

select timeTaken.withTime(9, 0, 0, 0) as timeTakenDated from RFIDEvent

12.3.15. ToCalendar

The toCalendar date-time method returns the Calendar object for this date-time value.

The method takes no parameters.

The example below outputs the time-taken as a Calendar object:

select timeTaken.toCalendar() as timeTakenCal from RFIDEvent

12.3.16. ToDate

The toDate date-time method returns the Date object for this date-time value.

The method takes no parameters.

The example below outputs the time-taken as a Date object:

select timeTaken.toDate() as timeTakenDate from RFIDEvent

12.3.17. ToMillisec

The toMillisec date-time method returns the long-typed millisecond value for this date-time value.

The method takes no parameters.

The example below outputs the time-taken as a long-typed millisecond value:

select timeTaken.toMillisec() as timeTakenLong from RFIDEvent

12.4. Interval Algebra Reference

Interval algebra methods compare start and end timestamps of events or timestamps in general.

When the expression input is only a timestamp value, such as a long-type value or a Date or Calendar object, the start and end timestamp represented by that value are the same timestamp value.

When expression input is an event stream alias, the compiler determine the event type for the stream. If the event type declares a start timestamp property name, the compiler uses that start timestamp property to determine the start timestamp for the event. If the event type also declares an end timestamp property name, the compiler uses that end timestamp property to determine the end timestamp for the event (i.e. an event with duration). If an end timestamp property name is not declared, the start and end timestamp for each event is the same value and the event is considered to have zero duration (i.e. a point-in-time event).

Interval algebra methods all return Boolean-type value. When the input value start timestamp is null, or the end timestamp (if declared for the event type) is null or any of the start timestamp and end timestamp (if declared for the event type) values of the first parameter is null, the result value is null.

12.4.1. Examples

The examples in this section simply use A and B as event type names. The alias a is used to represent A-type events and respectively the alias b represents B-type events.

The create-schema for types A and B is shown next. The two types are declared the same. The example declares the property providing start timestamp values as startts and the property providing end timestamp values as endts:

create schema A as (startts long, endts long) starttimestamp 'startts' endtimestamp 'endts'

create schema B as (startts long, endts long) starttimestamp 'startts' endtimestamp 'endts'

The sample EPL below joins the last A and the last B event. It detects A-B event combinations for which, when comparing timestamps, the last A event that occurs before the last B event. The example employs the before method:

select * from A#lastevent as a, B#lastevent as b where a.before(b)

For simplicity, the examples in this section refer to A and the alias a as the input event. The examples refer to B and the alias b as the parameter event.

12.4.2. Interval Algebra Parameters

The first parameter of each interval algebra methods is the event or timestamp to compare to.

All remaining parameters to interval algebra methods are intervals and can be any of the following:

A constant, an event property or more generally any expression returning a numeric value that is the number of seconds. For example, in the expression a.before(b, 2) the parameter 2 is interpreted to mean 2 seconds. The expression a.before(b, myIntervalProperty) is interpreted to mean myIntervalProperty seconds.
A time period expression as described in Section 12.4.11, “Includes”. For example: a.before(b, 1 hour 2 minutes).

When an interval parameter is provided and is null, the method result value is null.

12.4.3. Performance

The compiler analyzes interval algebra methods as well as the between date-time method in the where-clause and builds a query plan for execution of joins and subqueries. The query plan can include hash and btree index lookups using the start and end timestamps as computed by expressions or provided by events as applicable. Consider turning on query plan logging to obtain information on the query plan used.

The query planning is generally most effective when no additional thresholds or ranges are provided to interval algebra methods, as the query planner may not consider an interval algebra method that it cannot plan.

The query planner may also not optimally plan the query execution if events or expressions return different types of date representation. Query planning works best if all date representations use the same long, Date or Calendar types.

12.4.4. Limitations

Date-time method that change date or time fields, such as withTime, withDate, set or round methods set the end timestamp to the start timestamp.

For example, in the following expression the parameter to the after method has a zero duration, and not the end timestamp that the event B endts property provides.

a.after(b.withTime(9, 0, 0, 0))

12.4.5. After

The after date-time method returns true if an event happens after another event, or a timestamp is after another timestamp.

The method compares the input value's start timestamp (a.startTimestamp) to the first parameter's end timestamp (b.endTimestamp) to determine whether A happens after B.

If used with one parameter, for example in a.after(b), the method returns true if A starts after B ends.

If providing two parameters, for example in a.after(b, 5 sec), the method returns true if A starts at least 5 seconds after B ends.

select * from A#lastevent as a, B#lastevent as b where a.after(b)
// Above matches when:
//   a.startTimestamp - b.endTimestamp > 0

If providing three parameters, for example in a.after(b, 5 sec, 10 sec), the method returns true if A starts at least 5 seconds but no more then 10 seconds after B ends.

select * from A#lastevent as a, B#lastevent as b where a.after(b, 5 sec)
// Above matches when:
//   a.startTimestamp - b.endTimestamp >= 5 seconds

Negative values for the range are allowed. For example in a.after(b, -5 sec, -10 sec), the method returns true if A starts at least 5 seconds but no more then 10 seconds before B ends.

select * from A#lastevent as a, B#lastevent as b where a.after(b, 5 sec, 10 sec)
// Above matches when:
//   5 seconds <= a.startTimestamp - b.endTimestamp <= 10 seconds

If the range low endpoint is greater than the range high endpoint, the compiler automatically reverses them. Thus a.after(b, 10 sec, 5 sec) is the same semantics as a.after(b, 5 sec, 10 sec).

12.4.6. Before

The before date-time method returns true if an event happens before another event, or a timestamp is before another timestamp.

The method compares the input value's end timestamp (a.endTimestamp) and the first parameter's start timestamp (b.startTimestamp) to determine whether A happens before B.

If used with one parameter, for example in a.before(b), the method returns true if A ends before B starts.

If providing two parameters, for example in a.before(b, 5 sec), the method returns true if A ends at least 5 seconds before B starts.

select * from A#lastevent as a, B#lastevent as b where a.before(b)
// Above matches when:
//   b.startTimestamp - a.endTimestamp > 0

If providing three parameters, for example in a.before(b, 5 sec, 10 sec), the method returns true if A ends at least 5 seconds but no more then 10 seconds before B starts.

select * from A#lastevent as a, B#lastevent as b where a.before(b, 5 sec)
// Above matches when:
//   b.startTimestamp - a.endTimestamp >= 5 seconds

Negative values for the range are allowed. For example in a.before(b, -5 sec, -10 sec), the method returns true if A starts at least 5 seconds but no more then 10 seconds after B starts.

select * from A#lastevent as a, B#lastevent as b where a.before(b, 5 sec, 10 sec)
// Above matches when:
//   5 seconds <= b.startTimestamp - a.endTimestamp <= 10 seconds

If the range low endpoint is greater than the range high endpoint, the compiler automatically reverses them. Thus a.before(b, 10 sec, 5 sec) is the same semantics as a.before(b, 5 sec, 10 sec).

12.4.7. Coincides

The coincides date-time method returns true if an event and another event happen at the same time, or two timestamps are the same value.

The method compares the input value's start and end timestamp with the first parameter's start and end timestamp and determines if they equal.

If used with one parameter, for example in a.coincides(b), the method returns true if the start timestamp of A and B are the same and the end timestamps of A and B are also the same.

If providing two parameters, for example in a.coincides(b, 5 sec), the method returns true if the difference between the start timestamps of A and B is equal to or less then 5 seconds and the difference between the end timestamps of A and B is also equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.coincides(b)
// Above matches when:
//   a.startTimestamp = b.startTimestamp and a.endTimestamp = b.endTimestamp

If providing three parameters, for example in a.coincides(b, 5 sec, 10 sec), the method returns true if the difference between the start timestamps of A and B is equal to or less then 5 seconds and the difference between the end timestamps of A and B is equal to or less then 10 seconds.

select * from A#lastevent as a, B#lastevent as b where a.coincides(b, 5 sec)
// Above matches when:
//   abs(a.startTimestamp - b.startTimestamp) <= 5 sec and 
//   abs(a.endTimestamp - b.endTimestamp) <= 5 sec

A negative value for interval parameters is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

select * from A#lastevent as a, B#lastevent as b where a.coincides(b, 5 sec, 10 sec)
// Above matches when:
//   abs(a.startTimestamp - b.startTimestamp) <= 5 seconds and 
//   abs(a.endTimestamp - b.endTimestamp) <= 10 seconds

12.4.8. During

The during date-time method returns true if an event happens during the occurrence of another event, or when a timestamps falls within the occurrence of an event..

The method determines whether the input value's start and end timestamp are during the first parameter's start and end timestamp. The symmetrical opposite is Section 12.4.11, “Includes”.

If used with one parameter, for example in a.during(b), the method returns true if the start timestamp of A is after the start timestamp of B and the end timestamp of A is before the end timestamp of B.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.during(b)
// Above matches when:
//   b.startTimestamp < a.startTimestamp <= a.endTimestamp < b.endTimestamp

If providing two parameters, for example in a.during(b, 5 sec), the method returns true if the difference between the start timestamps of A and B is equal to or less then 5 seconds and the difference between the end timestamps of A and B is also equal to or less then 5 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.during(b, 5 sec)
// Above matches when:
//   0 < a.startTimestamp - b.startTimestamp <= 5 sec and 
//   0 < a.endTimestamp - b.endTimestamp <= 5 sec

If providing three parameters, for example in a.during(b, 5 sec, 10 sec), the method returns true if the difference between the start timestamps of A and B and the difference between the end timestamps of A and B is between 5 and 10 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.during(b, 5 sec, 10 sec)
// Above matches when:
//   5 seconds <= a.startTimestamp - b.startTimestamp <= 10 seconds and 
//   5 seconds <= a.endTimestamp - b.endTimestamp <= 10 seconds

If providing five parameters, for example in a.during(b, 5 sec, 10 sec, 20 sec, 30 sec), the method returns true if the difference between the start timestamps of A and B is between 5 seconds and 10 seconds and the difference between the end timestamps of A and B is between 20 seconds and 30 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b
  where a.during(b, 5 sec, 10 sec, 20 sec, 30 sec)
// Above matches when:
//   5 seconds <= a.startTimestamp - b.startTimestamp <= 10 seconds and 
//   20 seconds < a.endTimestamp - b.endTimestamp <= 30 seconds

12.4.9. Finishes

The finishes date-time method returns true if an event starts after another event starts and the event ends at the same time as the other event.

The method determines whether the input value's start timestamp is after the first parameter's start timestamp and the end timestamp of the input value and the first parameter are the same. The symmetrical opposite is Section 12.4.10, “Finished By”.

If used with one parameter, for example in a.finishes(b), the method returns true if the start timestamp of A is after the start timestamp of B and the end timestamp of A and B are the same.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.finishes(b)
// Above matches when:
//   b.startTimestamp < a.startTimestamp and a.endTimestamp = b.endTimestamp

If providing two parameters, for example in a.finishes(b, 5 sec), the method returns true if the start timestamp of A is after the start timestamp of B and the difference between the end timestamps of A and B is equal to or less then 5 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.finishes(b, 5 sec)
// Above matches when:
//   b.startTimestamp < a.startTimestamp and 
//   abs(a.endTimestamp - b.endTimestamp ) <= 5 seconds

A negative value for interval parameters is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

12.4.10. Finished By

The finishedBy date-time method returns true if an event starts before another event starts and the event ends at the same time as the other event.

The method determines whether the input value's start timestamp happens before the first parameter's start timestamp and the end timestamp of the input value and the first parameter are the same. The symmetrical opposite is Section 12.4.9, “Finishes”.

If used with one parameter, for example in a.finishedBy(b), the method returns true if the start timestamp of A is before the start timestamp of B and the end timestamp of A and B are the same.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.finishedBy(b)
// Above matches when:
//   a.startTimestamp < b.startTimestamp and a.endTimestamp = b.endTimestamp

If providing two parameters, for example in a.finishedBy(b, 5 sec), the method returns true if the start timestamp of A is before the start timestamp of B and the difference between the end timestamps of A and B is equal to or less then 5 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.finishedBy(b, 5 sec)
// Above matches when:
//   a.startTimestamp < b.startTimestamp and 
//   abs(a.endTimestamp - b.endTimestamp ) <= 5 seconds

12.4.11. Includes

The includes date-time method returns true if the parameter event happens during the occurrence of the input event, or when a timestamps falls within the occurrence of an event.

The method determines whether the first parameter's start and end timestamp are during the input value's start and end timestamp. The symmetrical opposite is Section 12.4.8, “During”.

If used with one parameter, for example in a.includes(b), the method returns true if the start timestamp of B is after the start timestamp of A and the end timestamp of B is before the end timestamp of A.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.includes(b)
// Above matches when:
//   a.startTimestamp < b.startTimestamp <= b.endTimestamp < a.endTimestamp

If providing two parameters, for example in a.includes(b, 5 sec), the method returns true if the difference between the start timestamps of A and B is equal to or less then 5 seconds and the difference between the end timestamps of A and B is also equal to or less then 5 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.includes(b, 5 sec)
// Above matches when:
//   0 < b.startTimestamp - a.startTimestamp <= 5 sec and 
//   0 < a.endTimestamp - b.endTimestamp <= 5 sec

If providing three parameters, for example in a.includes(b, 5 sec, 10 sec), the method returns true if the difference between the start timestamps of A and B and the difference between the end timestamps of A and B is between 5 and 10 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b where a.includes(b, 5 sec, 10 sec)
// Above matches when:
//   5 seconds <= a.startTimestamp - b.startTimestamp <= 10 seconds and 
//   5 seconds <= a.endTimestamp - b.endTimestamp <= 10 seconds

If providing five parameters, for example in a.includes(b, 5 sec, 10 sec, 20 sec, 30 sec), the method returns true if the difference between the start timestamps of A and B is between 5 seconds and 10 seconds and the difference between the end timestamps of A and B is between 20 seconds and 30 seconds.

Sample EPL:

select * from A#lastevent as a, B#lastevent as b
  where a.includes(b, 5 sec, 10 sec, 20 sec, 30 sec)
// Above matches when:
//   5 seconds <= a.startTimestamp - b.startTimestamp <= 10 seconds and 
//   20 seconds <= a.endTimestamp - b.endTimestamp <= 30 seconds

12.4.12. Meets

The meets date-time method returns true if the event's end time is the same as another event's start time.

The method compares the input value's end timestamp and the first parameter's start timestamp and determines whether they equal.

If used with one parameter, for example in a.meets(b), the method returns true if the end timestamp of A is the same as the start timestamp of B.

If providing two parameters, for example in a.meets(b, 5 sec), the method returns true if the difference between the end timestamp of A and the start timestamp of B is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.meets(b)
// Above matches when:
//   a.endTimestamp = b.startTimestamp

A negative value for the interval parameter is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

select * from A#lastevent as a, B#lastevent as b where a.meets(b, 5 sec)
// Above matches when:
//   abs(b.startTimestamp - a.endTimestamp) <= 5 seconds

12.4.13. Met By

The metBy date-time method returns true if the event's start time is the same as another event's end time.

The method compares the input value's start timestamp and the first parameter's end timestamp and determines whether they equal.

If used with one parameter, for example in a.metBy(b), the method returns true if the start timestamp of A is the same as the end timestamp of B.

If providing two parameters, for example in a.metBy(b, 5 sec), the method returns true if the difference between the end timestamps of B and the start timestamp of A is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.metBy(b)
// Above matches when:
//   a.startTimestamp = b.endTimestamp

A negative value for the interval parameter is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

select * from A#lastevent as a, B#lastevent as b where a.metBy(b, 5 sec)
// Above matches when:
//   abs(a.startTimestamp - b.endTimestamp) <= 5 seconds

12.4.14. Overlaps

The overlaps date-time method returns true if the event starts before another event starts and finishes after the other event starts, but before the other event finishes (events have an overlapping period of time).

The method determines whether the input value's start and end timestamp indicate an overlap with the first parameter's start and end timestamp, such that A starts before B starts and A ends after B started but before B ends.

If used with one parameter, for example in a.overlaps(b), the method returns true if the start timestamp of A is before the start timestamp of B and the end timestamp of A and is before the end timestamp of B.

If providing two parameters, for example in a.overlaps(b, 5 sec), the method returns true if, in addition, the difference between the end timestamp of A and the start timestamp of B is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.overlaps(b)
// Above matches when:
//   a.startTimestamp < b.startTimestamp < a.endTimestamp < b.endTimestamp

If providing three parameters, for example in a.overlaps(b, 5 sec, 10 sec), the method returns true if, in addition, the difference between the end timestamp of A and the start timestamp of B is between 5 and 10 seconds.

select * from A#lastevent as a, B#lastevent as b where a.overlaps(b, 5 sec)
// Above matches when:
//   a.startTimestamp < b.startTimestamp < a.endTimestamp < b.endTimestamp and 
//   0 <= a.endTimestamp - b.startTimestamp <= 5 seconds

The overlappedBy date-time method returns true if the parameter event starts before the input event starts and the parameter event finishes after the input event starts, but before the input event finishes (events have an overlapping period of time).

select * from A#lastevent as a, B#lastevent as b where a.overlaps(b, 5 sec, 10 sec)
// Above matches when:
//   a.startTimestamp < b.startTimestamp < a.endTimestamp < b.endTimestamp and 
//   5 seconds <= a.endTimestamp - b.startTimestamp <= 10 seconds

12.4.15. Overlapped By

The method determines whether the input value's start and end timestamp indicate an overlap with the first parameter's start and end timestamp, such that B starts before A starts and B ends after A started but before A ends.

If used with one parameter, for example in a.overlappedBy(b), the method returns true if the start timestamp of B is before the start timestamp of A and the end timestamp of B and is before the end timestamp of A.

If providing two parameters, for example in a.overlappedBy(b, 5 sec), the method returns true if, in addition, the difference between the end timestamp of B and the start timestamp of A is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.overlappedBy(b)
// Above matches when:
//   b.startTimestamp < a.startTimestamp < b.endTimestamp < a.endTimestamp

If providing three parameters, for example in a.overlappedBy(b, 5 sec, 10 sec), the method returns true if, in addition, the difference between the end timestamp of B and the start timestamp of A is between 5 and 10 seconds.

select * from A#lastevent as a, B#lastevent as b where a.overlappedBy(b, 5 sec)
// Above matches when:
//   b.startTimestamp < a.startTimestamp < b.endTimestamp < a.endTimestamp and 
//   0 <= b.endTimestamp - a.startTimestamp <= 5 seconds

The starts date-time method returns true if an event and another event start at the same time and the event's end happens before the other event's end.

select * from A#lastevent as a, B#lastevent as b where a.overlappedBy(b, 5 sec, 10 sec)
// Above matches when:
//   b.startTimestamp < a.startTimestamp < b.endTimestamp < a.endTimestamp and 
//   5 seconds <= b.endTimestamp - a.startTimestamp <= 10 seconds

12.4.16. Starts

The method determines whether the start timestamps of the input value and the first parameter are the same and the end timestamp of the input value is before the end timestamp of the first parameter.

If used with one parameter, for example in a.starts(b), the method returns true if the start timestamp of A and B are the same and the end timestamp of A is before the end timestamp of B.

If providing two parameters, for example in a.starts(b, 5 sec), the method returns true if the difference between the start timestamps of A and B is between is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.starts(b)
// Above matches when:
//   a.startTimestamp = b.startTimestamp and a.endTimestamp < b.endTimestamp

A negative value for the interval parameter is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

select * from A#lastevent as a, B#lastevent as b where a.starts(b, 5 sec)
// Above matches when:
//   abs(a.startTimestamp - b.startTimestamp) <= 5 seconds and 
//   a.endTimestamp < b.endTimestamp

12.4.17. Started By

The startedBy date-time method returns true if an event and another event start at the same time and the other event's end happens before the input event's end.

The method determines whether the start timestamp of the input value and the first parameter are the same and the end timestamp of the first parameter is before the end timestamp of the input value.

If used with one parameter, for example in a.startedBy(b), the method returns true if the start timestamp of A and B are the same and the end timestamp of B is before the end timestamp of A.

If providing two parameters, for example in a.startedBy(b, 5 sec), the method returns true if the difference between the start timestamps of A and B is between is equal to or less then 5 seconds.

select * from A#lastevent as a, B#lastevent as b where a.startedBy(b)
// Above matches when:
//   a.startTimestamp = b.startTimestamp and b.endTimestamp < a.endTimestamp

A negative value for the interval parameter is not allowed. If your interval parameter is itself an expression that returns a negative value the runtime logs a warning message and returns null.

select * from A#lastevent as a, B#lastevent as b where a.startedBy(b, 5 sec)
// Above matches when:
//   abs(a.startTimestamp - b.startTimestamp) <= 5 seconds and 
//    b.endTimestamp < a.endTimestamp

Chapter 13. EPL Reference: Data Windows

13.1. A Note on Data Window Name and Parameters

13.2. A Note on Batch Windows

13.3. Data Windows

13.3.1. Length Window (length or win:length)
13.3.2. Length Batch Window (length_batch or win:length_batch)
13.3.3. Time Window (time or win:time)
13.3.4. Externally-timed Window (ext_timed or win:ext_timed)
13.3.5. Time batch Window (time_batch or win:time_batch)
13.3.6. Externally-timed Batch Window (ext_timed_batch or win:ext_timed_batch)
13.3.7. Time-Length Combination Batch Window (time_length_batch or win:time_length_batch)
13.3.8. Time-Accumulating Window (time_accum or win:time_accum)
13.3.9. Keep-All Window (keepall or win:keepall)
13.3.10. First Length Window(firstlength or win:firstlength)
13.3.11. First Time Window (firsttime or win:firsttime)
13.3.12. Expiry Expression Window (expr or win:expr)
13.3.13. Expiry Expression Batch Window (expr_batch or win:expr_batch)
13.3.14. Unique Window (unique or std:unique)
13.3.15. Grouped Data Window (groupwin or std:groupwin)
13.3.16. Last Event Window (std:lastevent)
13.3.17. First Event Window (firstevent or std:firstevent)
13.3.18. First Unique Window (firstunique or std:firstunique)
13.3.19. Sorted Window (sort or ext:sort)
13.3.20. Ranked Window (rank or ext:rank)
13.3.21. Time-Order Window (time_order or ext:time_order)
13.3.22. Time-To-Live Window (timetolive or ext:timetolive)

13.4. Special Derived-Value Windows

13.4.1. Size Derived-Value Window (size) or std:size)
13.4.2. Univariate Statistics Derived-Value Window (uni or stat:uni)
13.4.3. Regression Derived-Value Window (linest or stat:linest)
13.4.4. Correlation Derived-Value Window (correl or stat:correl)
13.4.5. Weighted Average Derived-Value Window (weighted_avg or stat:weighted_avg)

This chapter outlines the data windows. The section on Chapter 2, Basic Concepts provides additional information on the relationship of filtering, windows and aggregation. Please also see Section 5.4.3, “Specifying Data Windows” for the use of windows in the from clause with streams, patterns and named windows.

Data windows retain incoming events until an expiry policy indicates to release events. Thus data windows are a means of indicating what subset of events to analyze.

Two or more data windows can be combined. This allows a sets of events retained by one data window to be placed into a union or an intersection with the set of events retained by one or more other data windows. Please see Section 5.4.4, “Multiple Data Windows” for more detail.

The keep-all data window counts as a data window but has no expiry policy: it retains all events received. The grouped-window declaration allocates a new data window per grouping criteria and thereby counts as a data window, but cannot appear alone.

The next table summarizes data windows:

Table 13.1. Built-in Data Windows

Data Window	Syntax	Description
Length Window	`length(`size`)`	Sliding length window extending the specified number of elements into the past.
Length Batch Window	`length_batch(`size`)`	Tumbling window that batches events and releases them when a given minimum number of events has been collected.
Time Window	`time(`time period`)`	Sliding time window extending the specified time interval into the past.
Externally-timed Window	`ext_timed(`timestamp expression, time period`)`	Sliding time window, based on the long-type time value supplied by an expression.
Time Batch Window	`time_batch(`time period[,optional reference point] [, flow control]`)`	Tumbling window that batches events and releases them every specified time interval, with flow control options.
Externally-timed Batch Window	`ext_timed_batch(`timestamp expression, time period[,optional reference point]`)`	Tumbling window that batches events and releases them every specified time interval based on the long-type value supplied by an expression.
Time-Length Combination Batch Window	`time_length_batch(`time period, size [, flow control]`)`	Tumbling multi-policy time and length batch window with flow control options.
Time-Accumulating Window	`time_accum(`time period`)`	Sliding time window accumulates events until no more events arrive within a given time interval.
Keep-All Window	`keepall`	The keep-all data window simply retains all events.
Sorted Window	`sort(`size, sort criteria`)`	Sorts by values returned by sort criteria expressions and keeps only the top events up to the given size.
Ranked Window	`rank(`unique criteria(s), size, sort criteria(s)`)`	Retains only the most recent among events having the same value for the criteria expression(s) sorted by sort criteria expressions and keeps only the top events up to the given size.
Time-Order Window	`time_order(`timestamp expression, time period`)`	Orders events that arrive out-of-order, using an expression providing timestamps to be ordered.
Time-To-Live Window	`timetolive(`timestamp expression`)`	Retains events until the time returned by the timestamp expression.
Unique Window	`unique(`unique criteria(s)`)`	Retains only the most recent among events having the same value for the criteria expression(s). Acts as a length window of size 1 for each distinct expression value.
Grouped Data Window	`groupwin(`grouping criteria(s)`)`	Groups events into sub-data-windows by the value of the specified expression(s), generally used to provide a separate data window per group.
Last Event Window	`lastevent`	Retains the last event, acts as a length window of size 1.
First Event Window	`firstevent`	Retains the very first arriving event, disregarding all subsequent events.
First Unique Window	`firstunique(`unique criteria(s)`)`	Retains only the very first among events having the same value for the criteria expression(s), disregarding all subsequent events for same value(s).
First Length Window	`firstlength(`size`)`	Retains the first size events, disregarding all subsequent events.
First Time Window	`firsttime(`time period`)`	Retains the events arriving until the time interval has passed, disregarding all subsequent events.
Expiry Expression Window	`expr(`expiry expression`)`	Expire events based on the result of an expiry expression passed as a parameter.
Expiry Expression Batch Window	`expr_batch(`expiry expression`)`	Tumbling window that batches events and releases them based on the result of an expiry expression passed as a parameter.

There is a special kind of data window that is used less frequently, and is called a derived-value window. They are windows that derive a new value from event streams and post the result as events of a new type. The table below summarizes these special derived-value windows.

Table 13.2. Built-in Derived-Value Data Windows

Data Window	Syntax	Description
Size	`size(`[expression, ...]`)`	Derives a count of the number of events in a data window, or in an insert stream if used without a data window, and optionally provides additional event properties as listed in parameters.
Univariate statistics	`uni(`value expression [,expression, ...]`)`	Calculates univariate statistics on the values returned by the expression.
Regression	`linest(`value expression, value expression [,expression, ...]`)`	Calculates regression on the values returned by two expressions.
Correlation	`correl(`value expression, value expression [,expression, ...]`)`	Calculates the correlation value on the values returned by two expressions.
Weighted average	`weighted_avg(`value expression, value expression [,expression, ...]`)`	Calculates weighted average given a weight expression and an expression to compute the average for.

13.1. A Note on Data Window Name and Parameters

The syntax for data windows starts with data window name and is followed by optional parameter expressions in parenthesis:

name(window_parameters)

This example specifies a time window of 5 seconds:

select * from StockTickEvent#time(5 sec)

EPL organizes built-in data windows in namespaces and names. Windows that provide sliding or tumbling data windows are in the win namespace. Other most commonly used windows are in the std namespace. The ext namespace are window that order events. The stat namespace is used for windows that derive statistical data.

Alternatively you may specify the namespace name and : colon character.

namespace:name(window_parameters)

The below examples all specify a time window of 5 seconds:

select * from StockTickEvent#time(5 sec)

select * from StockTickEvent#win:time(5 sec)

select * from StockTickEvent.win:time(5 sec)

All expressions are allowed as parameters to data windows, including expressions that contain variables or substitution parameters for prepared statements. Subqueries, the special prior and prev functions and aggregations (with the exception of the expression window and expression batch window) are not allowed as data window parameters.

For example, assuming a variable by name VAR_WINDOW_SIZE is defined:

select * from StockTickEvent#time(VAR_WINDOW_SIZE)

The system evaluates expression parameters for data windows at the time of context partition instantiation with the exception of the expression window (expr) and expression batch window (expr_batch).

Also consider multiple data windows in intersection or union (keywords retain-intersection and retain-union). Consider writing a custom plug-in data window if your application requires behavior that is not yet provided by any of the built-in windows.

If a window takes no parameters you may leave parenthesis off or the use empty parenthesis ().

The below examples all specify a keep-all window:

select * from StockTickEvent#keepall

select * from StockTickEvent#keepall()

select * from StockTickEvent.win:keepall()

select * from StockTickEvent.win:keepall

Expression parameters can reference context-provided properties. For example:

create schema ParameterEvent(windowSize int)

create context MyContext initiated by ParameterEvent as params terminated after 1 year

context MyContext select * from StockTickEvent#length(context.params.windowSize)

13.2. A Note on Batch Windows

Batch windows buffer events until a certain threshold is reached and then release the batched events for processing. The released events become the insert stream events and the previous batch of events constitutes the remove stream events. Batch windows thus retain the current and the last batch of events in memory.

It is often desirable to aggregate without retaining events in memory, or with just keeping the current events in memory (and not also the last batch of events). You can declare a context and define what starts and ends a "batch" instead. Contexts provide a large degree of freedom in allowing batches to overlap, in allowing batches to span multiple statements and in allowing batches to have complex start and end conditions. They are further described in Chapter 4, Context and Context Partitions.

This example declares a non-overlapping context that spans a time interval of 3 seconds (i.e. a batch of 3 seconds):

create context IntervalSpanning3Seconds start @now end after 3 sec

The next example EPL aggregates events without retaining events in memory and outputs at the end of each interval:

context IntervalSpanning3Seconds select count(*) from Events output snapshot when terminated

Here is an example that outputs all events when at least 10 events, in the 3-second interval, have collected:

context IntervalSpanning3Seconds select window(*) from Events#keepall having count(*) >= 10

For the examples above, at the end of each 3-second interval, the runtime discards all data windows and aggregation state. If your application would like 3-second intervals keyed by some fields please consider a nested context declaration with a keyed segmented context, for example:

create context PerSymbolInterval3Sec 
  context ById partition by symbol from StockTick, 
  context Interval3Sec start @now end after 3 sec

Batch windows keep not only the current batch in memory but also the previous batch of events. For example, let's say at time 0 an event arrives and enters the batch window. At time 3 seconds (3-second batch window) the event becomes an insert-stream event and the runtime now updates aggregations for that batch (i.e. count goes up to 1). At time 6 seconds the event becomes a remove-stream event and the runtime now updates aggregations for that batch (i.e. count goes down to 0). Since the runtime continually updates aggregations from insert and remove stream events, and does not re-compute aggregations, batch windows follow the same paradigm.

13.3. Data Windows

13.3.1. Length Window (`length` or `win:length`)

This window is a moving (sliding) length window extending the specified number of elements into the past. The window takes a single expression as a parameter providing a numeric size value that defines the window size:

length(size_expression)

The below example sums the price for the last 5 stock ticks for symbol GE.

select sum(price) from StockTickEvent(symbol='GE')#length(5)

The next example keeps a length window of 10 events of stock trade events, with a separate window for each symbol. The sum of price is calculated only for the last 10 events for each symbol and aggregates per symbol:

select sum(price) from StockTickEvent#groupwin(symbol)#length(10) group by symbol

A length window of 1 is equivalent to the last event window lastevent. The lastevent data window is the preferred notation:

select * from StockTickEvent#lastevent	// Prefer this
// ... equivalent to ...
select * from StockTickEvent#length(1)

13.3.2. Length Batch Window (`length_batch` or `win:length_batch`)

This window buffers events (tumbling window) and releases them when a given minimum number of events has been collected. Provide an expression defining the number of events to batch as a parameter:

length_batch(size_expression)

The next statement buffers events until a minimum of 10 events have collected. Listeners to updates posted by this window receive updated information only when 10 or more events have collected.

select * from StockTickEvent#length_batch(10)

13.3.3. Time Window (`time` or `win:time`)

This window is a moving (sliding) time window extending the specified time interval into the past based on the system time. Provide a time period (see Section 5.2.1, “Specifying Time Periods”) or an expression defining the number of seconds as a parameter:

time(time period)

time(seconds_interval_expression)

For the GE stock tick events in the last 1 second, calculate a sum of price.

select sum(price) from StockTickEvent(symbol='GE')#time(1 sec)

The following time windows are equivalent specifications:

time(2 minutes 5 seconds)
time(125 sec)
time(125)
time(MYINTERVAL)  // MYINTERVAL defined as a variable

13.3.4. Externally-timed Window (`ext_timed` or `win:ext_timed`)

Similar to the time window, this window is a moving (sliding) time window extending the specified time interval into the past, but based on the long-type time value supplied by a timestamp expression. The window takes two parameters: the expression to return long-typed timestamp values, and a time period or expression that provides a number of seconds:

ext_timed(timestamp_expression, time_period)

ext_timed(timestamp_expression, seconds_interval_expression)

The key difference comparing the externally-timed window to the regular time window is that the window slides not based on the runtime time, but strictly based on the result of the timestamp expression when evaluated against the events entering the window.

The algorithm underlying the window compares the timestamp value returned by the expression when the oldest event arrived with the timestamp value returned by the expression for the newest arriving event on event arrival. If the time interval between the timestamp values is larger then the timer period parameter, then the algorithm removes all oldest events tail-first until the difference between the oldest and newest event is within the time interval. The window therefore slides only when events arrive and only considers each event's timestamp property (or other expression value returned) and not runtime time.

This window holds stock tick events of the last 10 seconds based on the timestamp property in StockTickEvent.

select * from StockTickEvent#ext_timed(timestamp, 10 seconds)

The externally-timed data window expects strict ordering of the timestamp values returned by the timestamp expression. The window is not useful for ordering events in time order, please use the time-order window instead.

On a related subject, runtime time itself can be entirely under control of the application as described in Section 15.9, “Controlling Time-Keeping”, allowing control over all time-based aspects of processing in one place.

13.3.5. Time batch Window (`time_batch` or `win:time_batch`)

This window buffers events (tumbling window) and releases them every specified time interval in one update. The window takes a time period or an expression providing a number of seconds as a parameter, plus optional parameters described next.

time_batch(time_period [,optional_reference_point] [,flow_control])

time_batch(seconds_interval_expression [,optional_reference_point] [,flow_control])

The time batch window takes a second, optional parameter that serves as a reference point to batch flush times. If not specified, the arrival of the first event into the batch window sets the reference point. Therefore if the reference point is not specified and the first event arrives at time t₁, then the batch flushes at time t₁ plus time_period and every time_period thereafter.

Note

Please see Section 13.2, “A Note on Batch Windows” for information on what a batch window is and how to best to compute over intervals.

Note that using this window means that the runtime keeps events in memory until the time is up: Consider your event arrival rate and determine if this is the behavior you want. Use context declaration or output rate limiting such as output snapshot as an alternative.

The below example batches events into a 5 second window releasing new batches every 5 seconds. Listeners to updates posted by this window receive updated information only every 5 seconds.

select * from StockTickEvent#time_batch(5 sec)

By default, if there are no events arriving in the current interval (insert stream), and no events remain from the prior batch (remove stream), then the window does not post results to listeners. The window allows overriding this default behavior via flow control keywords.

The synopsis with flow control parameters is:

time_batch(time_period or seconds_interval_expr [,optional_reference_point] 
    [, "flow-control-keyword [, keyword...]"] )

The FORCE_UPDATE flow control keyword instructs the window to post an empty result set to listeners if there is no data to post for an interval. When using this keyword the irstream keyword should be used in the select clause to ensure the remove stream is also output. Note that FORCE_UPDATE is for use with listeners to the same statement and not for use with named windows. Consider output rate limiting instead.

The START_EAGER flow control keyword instructs the window to post empty result sets even before the first event arrives, starting a time interval at statement deployment time. As when using FORCE_UPDATE, the window also posts an empty result set to listeners if there is no data to post for an interval, however it starts doing so at time of statement deployment rather then at the time of arrival of the first event.

Taking the two flow control keywords in one sample statement, this example presents a window that waits for 10 seconds. It posts empty result sets after one interval after the statement gets deployed and keeps posting an empty result set as no events arrive during intervals:

select * from MyEvent#time_batch(10 sec, "FORCE_UPDATE, START_EAGER")

The optional reference point is provided as a long-value of milliseconds (or microseconds for microsecond runtime time unit) relative to January 1, 1970 and time 00:00:00.

The following example statement sets the reference point to 5 seconds and the batch size to 1 hour, so that each batch output is 5 seconds after each hour:

select * from OrderSummaryEvent#time_batch(1 hour, 5000L)

13.3.6. Externally-timed Batch Window (`ext_timed_batch` or `win:ext_timed_batch`)

Similar to the time batch window, this window buffers events (tumbling) and releases them every specified time interval in one update, but based on the long-type time value supplied by a timestamp expression. The window has two required parameters taking an expression that returns long-typed timestamp values and a time period or constant-value expression that provides a number of seconds:

ext_timed_batch(timestamp_expression, time_period [,optional_reference_point])

ext_timed_batch(timestamp_expression, seconds_interval_expression [,optional_reference_point])

The externally-timed batch window takes a third, optional parameter that serves as a reference point to batch flush times. If not specified, the arrival of the first event into the batch window sets the reference point. Therefore if the reference point is not specified and the first event arrives at time t₁, then the batch flushes at time t₁ plus time_period and every time_period thereafter.

The key difference comparing the externally-timed batch window to the regular time batch window is that the window tumbles not based on the runtime time, but strictly based on the result of the timestamp expression when evaluated against the events entering the window.

The algorithm underlying the window compares the timestamp value returned by the expression when the oldest event arrived with the timestamp value returned by the expression for the newest arriving event on event arrival. If the time interval between the timestamp values is larger then the timer period parameter, then the algorithm posts the current batch of events. The window therefore posts batches only when events arrive and only considers each event's timestamp property (or other expression value returned) and not runtime time.

The below example batches events into a 5 second window releasing new batches every 5 seconds. Listeners to updates posted by this window receive updated information only when event arrive with timestamps that indicate the start of a new batch:

select * from StockTickEvent#ext_timed_batch(timestamp, 5 sec)

The optional reference point is provided as a long-value of milliseconds (or microseconds) relative to January 1, 1970 and time 00:00:00.

The following example statement sets the reference point to 5 seconds and the batch size to 1 hour, so that each batch output is 5 seconds after each hour:

select * from OrderSummaryEvent#ext_timed_batch(timestamp, 1 hour, 5000L)

13.3.7. Time-Length Combination Batch Window (`time_length_batch` or `win:time_length_batch`)

This data window is a combination of time and length batch (tumbling) windows. Similar to the time and length batch windows, this batches events and releases the batched events when either one of the following conditions occurs, whichever occurs first: the data window has collected a given number of events, or a given time interval has passed.

The parameters take 2 forms. The first form accepts a time period or an expression providing a number of seconds, and an expression for the number of events:

time_length_batch(time_period, number_of_events_expression)

time_length_batch(seconds_interval_expression, number_of_events_expression)

The next example shows a time-length combination batch window that batches up to 100 events or all events arriving within a 1-second time interval, whichever condition occurs first:

 select * from MyEvent#time_length_batch(1 sec, 100)

In this example, if 100 events arrive into the window before a 1-second time interval passes, the window posts the batch of 100 events. If less then 100 events arrive within a 1-second interval, the window posts all events that arrived within the 1-second interval at the end of the interval.

By default, if there are no events arriving in the current interval (insert stream), and no events remain from the prior batch (remove stream), then the window does not post results to listeners. This window allows overriding this default behavior via flow control keywords.

The synopsis of the window with flow control parameters is:

time_length_batch(time_period or seconds_interval_expression, number_of_events_expression, 
    "flow control keyword [, keyword...]")

The FORCE_UPDATE flow control keyword instructs the window to post an empty result set to listeners if there is no data to post for an interval. The window begins posting no later then after one time interval passed after the first event arrives. When using this keyword the irstream keyword should be used in the select clause to ensure the remove stream is also output.

The START_EAGER flow control keyword instructs the window to post empty result sets even before the first event arrives, starting a time interval at statement deployment time. As when using FORCE_UPDATE, the window also posts an empty result set to listeners if there is no data to post for an interval, however it starts doing so at time of statement deployment rather then at the time of arrival of the first event.

Taking the two flow control keywords in one sample statement, this example presents a window that waits for 10 seconds or reacts when the 5th event arrives, whichever comes first. It posts empty result sets after one interval after the statement gets deployed and keeps posting an empty result set as no events arrive during intervals:

 select * from MyEvent#time_length_batch(10 sec, 5, "FORCE_UPDATE, START_EAGER")

13.3.8. Time-Accumulating Window (`time_accum` or `win:time_accum`)

This data window is a specialized moving (sliding) time window that differs from the regular time window in that it accumulates events until no more events arrive within a given time interval, and only then releases the accumulated events as a remove stream.

The window accepts a single parameter: the time period or seconds-expression specifying the length of the time interval during which no events must arrive until the window releases accumulated events. The synopsis is as follows:

time_accum(time_period)

time_accum(seconds_interval_expression)

The next example shows a time-accumulating window that accumulates events, and then releases events if within the time interval no more events arrive:

 select * from MyEvent#time_accum(10 sec)

This example accumulates events, until when for a period of 10 seconds no more MyEvent events arrive, at which time it posts all accumulated MyEvent events.

Your application may only be interested in the batches of events as events leave the data window. This can be done simply by selecting the remove stream of this data window, populated by the runtime as accumulated events leave the data window all-at-once when no events arrive during the time interval following the time the last event arrived:

 select rstream * from MyEvent#time_accum(10 sec)

If there are no events arriving, then the window does not post results to listeners.

13.3.9. Keep-All Window (`keepall` or `win:keepall`)

This keep-all data window simply retains all events. The window does not remove events from the data window, unless used with a named window and the on delete clause.

The window accepts no parameters. The synopsis is as follows:

keepall

The next example shows a keep-all window that accumulates all events received into the window:

 select * from MyEvent#keepall

Note that since the window does not release events, care must be taken to prevent retained events from using all available resources.

13.3.10. First Length Window(`firstlength` or `win:firstlength`)

The firstlength window retains the very first size_expression events.

The synopsis is:

firstlength(size_expression)

If used within a named window and an on-delete clause deletes events, the window accepts further arriving events until the number of retained events reaches the size of size_expression.

The below example creates a window that retains only the first 10 events:

select * from MyEvent#firstlength(10)

13.3.11. First Time Window (`firsttime` or `win:firsttime`)

The firsttime window retains all events arriving within a given time interval after statement start.

The synopsis is:

firsttime(time_period)

firsttime(seconds_interval_expression)

The below example creates a window that retains only those events arriving within 1 minute and 10 seconds of statement start:

select * from MyEvent#firsttime(1 minute 10 seconds)

13.3.12. Expiry Expression Window (`expr` or `win:expr`)

The expr data window applies an expiry expression and removes events from the data window when the expression returns false.

Use this window to implement rolling and dynamically shrinking or expanding time, length or other windows. Rolling can, for example, be controlled based on event properties of arriving events, based on aggregation values or based on the return result of user-defined functions. Use this window to accumulate events until a value changes or other condition occurs based on arriving events or change of a variable value.

The synopsis is:

expr(expiry_expression)

The expiry expression can be any expression including expressions on event properties, variables, aggregation functions or user-defined functions. The window applies this expression to the oldest event(s) currently in the window, as described next.

When a new event arrives or when a variable value referenced by the expiry expression changes then the window applies the expiry expression starting from the oldest event in the data window. If the expiry expression returns false for the oldest event, the window removes the event from the data window. The window then applies the expression to the next oldest event. If the expiry expression returns true for the oldest event, no further evaluation takes place and the window indicates any new and expired events through insert and remove stream.

By using variables in the expiry expression it is possible to change the behavior of the window dynamically at runtime. When one or more variables used in the expression are updated the window evaluates the expiry expression starting from the oldest event.

Aggregation functions, if present in the expiry expression, are continuously updated as events enter and leave the data window. Use the grouped data window with this window to compute aggregations per group.

The runtime makes the following built-in properties available to the expiry expression:

Table 13.3. Built-in Properties of the Expiry Expression Data Window

Name	Type	Description
`current_count`	int	The number of events in the data window including the currently-arriving event.
`expired_count`	int	The number of events expired during this evaluation.
`newest_event`	(same event type as arriving events)	The last-arriving event itself.
`newest_timestamp`	long	The runtime timestamp associated with the last-arriving event.
`oldest_event`	(same event type as arriving events)	The currently-evaluated event itself.
`oldest_timestamp`	long	The runtime timestamp associated with the currently-evaluated event.
`view_reference`	Object	The object handle to this data window.

This EPL declares an expiry expression that retains the last 2 events:

select * from MyEvent#expr(current_count <= 2)

The following example implements a dynamically-sized length window by means of a SIZE variable. As the SIZE variable value changes the window retains the number of events according to the current value of SIZE:

create variable int SIZE = 1000

select * from MyEvent#expr(current_count <= SIZE)

The next EPL retains the last 2 seconds of events:

select * from MyEvent#expr(oldest_timestamp > newest_timestamp - 2000)

The following example implements a dynamically-sized time window. As the SIZE long-type variable value changes the window retains a time interval accordingly:

create variable long SIZE = 1000

select * from MyEvent#expr(newest_timestamp - oldest_timestamp < SIZE)

The following example declares a KEEP variable and flushes all events from the data window when the variable turns false:

create variable boolean KEEP = true

select * from MyEvent#expr(KEEP)

The next example specifies a rolling window that removes the oldest events from the window until the total price of all events in the window is less then 1000:

select * from MyEvent#expr(sum(price) < 1000)

This example retains all events that have the same value of the flag event property. When the flag value changes, the data window expires all events with the old flag value and retains only the most recent event of the new flag value:

select * from MyEvent#expr(newest_event.flag = oldest_event.flag)

13.3.12.1. Limitations

You may not use subqueries or the prev and prior functions as part of the expiry expression. Consider using a named window and on-delete or on-merge instead.

When using variables in the expiry expression, the thread that updates the variable does not evaluate the window. The thread that updates the variable instead schedules a reevaluation and window evaluates by timer execution.

13.3.13. Expiry Expression Batch Window (`expr_batch` or `win:expr_batch`)

The expr_batch buffers events (tumbling window) and releases them when a given expiry expression returns true.

Use this window to implement dynamic or custom batching behavior, such as for dynamically shrinking or growing time, length or other batches, for batching based on event properties of arriving events, aggregation values or for batching based on a user-defined function.

The synopsis is:

expr_batch(expiry_expression, [include_triggering_event])

The optional second parameter include_triggering_event defines whether to include the event that triggers the batch in the current batch (true, the default) or in the next batch (false).

When a new event arrives or when a variable value referenced by the expiry expression changes or when events get removed from the data window then the window applies the expiry expression. If the expiry expression returns true the data window posts the collected events as the insert stream and the last batch of events as remove stream.

Aggregation functions, if present in the expiry expression, are continuously updated as events enter the data window and reset when the runtime posts a batch of events. Use the grouped data window with this window to compute aggregations per group.

The compiler makes the following built-in properties available to the expiry expression:

Table 13.4. Built-in Properties of the Expiry Expression Data Window

Name	Type	Description
`current_count`	int	The number of events in the data window including the currently-arriving event.
`newest_event`	(same event type as arriving events)	The last-arriving event itself.
`newest_timestamp`	long	The runtime timestamp associated with the last-arriving event.
`oldest_event`	(same event type as arriving events)	The currently-evaluated event itself.
`oldest_timestamp`	long	The runtime timestamp associated with the currently-evaluated event.
`view_reference`	Object	The object handle to this window.

This EPL declares an expiry expression that posts event batches consisting of 2 events:

select * from MyEvent#expr_batch(current_count >= 2)

The following example implements a dynamically-sized length batch window by means of a SIZE variable. As the SIZE variable value changes the window accumulates and posts the number of events according to the current value of SIZE:

create variable int SIZE = 1000

select * from MyEvent#expr_batch(current_count >= SIZE)

The following example accumulates events until an event arrives that has a value of postme for property myvalue:

select * from MyEvent#expr_batch(myvalue = 'postme')

The following example declares a POST variable and posts a batch of events when the variable turns true:

create variable boolean POST = false

select * from MyEvent#expr_batch(POST)

The next example specifies a tumbling window that posts a batch of events when the total price of all events in the window is greater then 1000:

select * from MyEvent#expr_batch(sum(price) > 1000)

Specify the second parameter as false when you want the triggering event not included in the current batch.

This example batches all events that have the same value of the flag event property. When the flag value changes, the data window releases the batch of events collected for the old flag value. The data window collects the most recent event and the future arriving events of the same new flag value:

select * from MyEvent#expr_batch(newest_event.flag != oldest_event.flag, false)

13.3.13.1. Limitations

You may not use subqueries or the prev and prior functions as part of the expiry expression. Consider using a named window and on-delete or on-merge instead.

13.3.14. Unique Window (`unique` or `std:unique`)

The unique window is a window that includes only the most recent among events having the same value(s) for the result of the specified expression or list of expressions.

The synopsis is:

unique(unique_expression [, unique_expression ...])

The window acts as a length window of size 1 for each distinct value returned by an expression, or combination of values returned by multiple expressions. It thus posts as old events the prior event of the same value(s), if any.

An expression may return a null value. The compiler treats a null value as any other value. An expression can also return a custom application object, whereby the application class should implement the hashCode and equals methods.

The below example creates a window that retains only the last event per symbol.

select * from StockTickEvent#unique(symbol)

The next example creates a window that retains the last event per symbol and feed.

select * from StockTickEvent#unique(symbol, feed)

When using unique the compiler plans statements applying an implicit unique index, where applicable. Specify @Hint('disable_unique_implicit_idx') to force the compiler to plan statement using a non-unique index.

13.3.15. Grouped Data Window (`groupwin` or `std:groupwin`)

Specifying #groupwin groups events into sub-data-window by the value returned by the specified expression or the combination of values returned by a list of expressions. The #groupwin takes a single expression to supply the group criteria values, or a list of expressions as parameters, as the synopsis shows:

groupwin(grouping_expression [, grouping_expression ...])

The grouping_expression expression(s) return one or more group keys, by which it creates a separate data window for each distinct group key. Note that the expression should not return an unlimited number of values: the grouping expression should not return a time value or otherwise unlimited key.

An expression may return a null value. The runtime treats a null value as any other value. An expression can also return a custom application object, whereby the application class should implement the hashCode and equals methods.

You can specify a single groupwin per stream. Multiple groupwin declarations for the same stream are not allowed.

Use group by instead of the grouped data window to control how aggregations are grouped.

A grouped data window with a length window of 1 is equivalent to the unique data window unique. The unique data window is the preferred notation:

select * from StockTickEvent#unique(symbol)	// Prefer this
// ... equivalent to ...
select * from StockTickEvent#groupwin(symbol)#length(1)

This example computes the total price for the last 5 events considering the last 5 events per each symbol, aggregating the price across all symbols (since no group by clause is specified the aggregation is across all symbols):

select symbol, sum(price) from StockTickEvent#groupwin(symbol)#length(5)

The @Hint("reclaim_group_aged=age_in_seconds") hint instructs the runtime to discard grouped data window state that has not been updated for age_in_seconds seconds. The optional @Hint("reclaim_group_freq=sweep_frequency_in_seconds") can be specified in addition to control the frequency at which the runtime sweeps data window state. If the hint is not specified, the frequency defaults to the same value as age_in_seconds. Use the hints when your group criteria returns a changing or unlimited number of values. By default and without hints the data window does not reclaim or remove data windows for group criteria values.

The updated sample statement with both hints:

// Remove data window for symbols not updated for 10 seconds or more and sweep every 30 seconds
@Hint('reclaim_group_aged=10,reclaim_group_freq=30')
select symbol, sum(price) from StockTickEvent#groupwin(symbol)#length(5)

Reclaim executes when an event arrives and not in the timer thread. In the example above reclaim can occur up to 40 seconds of runtime time after the newest event arrives. Reclaim may affect iteration order for the statement and iteration order becomes indeterministic with reclaim.

To compute the total price for the last 5 events considering the last 5 events per each symbol and outputting a price per symbol, add the group by clause:

select symbol, sum(price) from StockTickEvent#groupwin(symbol)#length(5) group by symbol

The groupwin grouped-window can also take multiple expressions that provide values to group by. This example computes the total price for each symbol and feed for the last 10 events per symbol and feed combination:

select sum(price) from StockTickEvent#groupwin(symbol, feed)#length(10)

The order in which the groupwin grouped-window appears controls the data the runtime derives from events for each group. The next 2 statements demonstrate this using a length window.

Without the groupwin declaration the same statement returns the total price per symbol for only the last 10 events across all symbols. Here the runtime allocates only one length window for all events:

select sum(price) from StockTickEvent#length(10)

We have learned that by placing the groupwin grouped-window before other data windows, these other data windows become part of the grouped set of windows. The runtime dynamically allocates a new window instance for each, every time it encounters a new group key such as a new value for symbol. Therefore, in groupwin(symbol)#length(10) the runtime allocates a new length window for each distinct symbol. However in length(10) alone the runtime maintains a single length window.

The groupwin can be used with multiple data windows to achieve a grouped intersection or union policy.

The next statement retains the last 4 events per symbol and only those events that are also not older then 10 seconds:

select * from StockTickEvent#groupwin(symbol)#length(4)#time(10)

Last, considers a grouped data window for two group criteria. Here, the statement results are total price per symbol and feed for the last 100 events per symbol and feed.

select sum(price) from StockTickEvent#groupwin(symbol, feed)#length(100)

Note

A note on grouped time windows: When using grouped-window with time windows, note that whether the runtime retains 5 minutes of events or retains 5 minutes of events per group, the result is the same from the perspective of retaining events as both policies retain, considering all groups, the same set of events. Therefore please specify the time window alone (ungrouped).

For example:

// Use this:
select sum(price) from StockTickEvent#time(1 minute)

// is equivalent to (don't use this):
// select sum(price) from StockTickEvent#groupwin(symbol)#time(1 minute)

// Use the group-by clause for grouping aggregation by symbol.

For advanced users: There is an optional declaration that can control how the groupwin grouped-window gets evaluated and that is #merge. The merge can only occur after a groupwin grouped-window. It controls the end of the grouped declaration.

Compare the following statements:

select * from Market#groupwin(ticker)#length(1000000)
    #weighted_avg(price, volume)#merge(ticker)
// ... and ...
select * from Market#groupwin(ticker)#length(1000000)#merge(ticker)
    #weighted_avg(price, volume)

If your statement does not specify the optional #merge, the semantics are the same as the first statement.

The first statement, in which the #mergeis added to the end (same as no merge), computes weighted average per ticker, considering, per-ticker, the last 1M Market events for each ticker. The second statement, in which the merge is added to the middle, computes weighted average considering, per-ticker, the last 1M Market events, computing the weighted average for all such events using a single data window rather then multiple data window instances with one window per ticker.

13.3.16. Last Event Window (`std:lastevent`)

This window exposes the last element:

lastevent

The window acts as a length window of size 1. It thus posts as old events the prior event in the stream, if any.

This example statement retains the last stock tick event for the symbol GE.

select * from StockTickEvent(symbol='GE')#lastevent

If you want to output the last event within a sliding window, please see Section 10.1.12, “The Previous Function”. That function accepts a relative (count) or absolute index and returns event properties or an event in the context of the specified data window.

13.3.17. First Event Window (`firstevent` or `std:firstevent`)

This window retains only the first arriving event:

firstevent

All events arriving after the first event are discarded.

If used within a named window and an on-delete clause deletes the first event, the window resets and will retain the next arriving event.

An example of a statement that retains the first ReferenceData event arriving is:

select * from ReferenceData#firstevent

If you want to output the first event within a sliding window, please see Section 10.1.12, “The Previous Function”. That function accepts a relative (count) or absolute index and returns event properties or an event in the context of the specified data window.

13.3.18. First Unique Window (`firstunique` or `std:firstunique`)

The firstunique window retains only the very first among events having the same value for the specified expression or list of expressions.

The synopsis is:

firstunique(unique_expression [, unique_expression ...])

If used within a named window and an on-delete clause deletes events, the window resets and will retain the next arriving event for the expression result value(s) of the deleted events.

The below example creates a window that retains only the first event per category:

select * from ReferenceData#firstunique(category)

When using firstunique the compiler plans statements applying an implicit unique index, where applicable. Specify @Hint('disable_unique_implicit_idx') to force the compiler to plan statements using a non-unique index.

13.3.19. Sorted Window (`sort` or `ext:sort`)

This window sorts by values returned by the specified expression or list of expressions and keeps only the top (or bottom) events up to the given size.

This window retains all events in the stream that fall into the sort range. Use the ranked window as described next to retain events per unique key(s) and sorted.

The syntax is as follows:

sort(size_expression, 
    sort_criteria_expression [asc/desc][, sort_criteria_expression [asc/desc]...])

An expression may be followed by the optional asc or desc keywords to indicate that the values returned by that expression are sorted in ascending or descending sort order.

The window below retains only those events that have the highest 10 prices considering all events (and not only the last event per symbol, see rank below) and reports a total price:

select sum(price) from StockTickEvent#sort(10, price desc)

The following example sorts events first by price in descending order, and then by symbol name in ascending (alphabetical) order, keeping only the 10 events with the highest price (with ties resolved by alphabetical order of symbol).

select * from StockTickEvent#sort(10, price desc, symbol asc)

The sorted window is often used with the prev, prevwindow or prevtail single-row functions to output properties of events at a certain position or to output the complete data window according to sort order.

Use the grouped window to retain a separate sort window for each group. For example, the windows groupwin(market)#sort(10, price desc) instruct the runtime to retain, per market, the highest 10 prices.

13.3.20. Ranked Window (`rank` or `ext:rank`)

This window retains only the most recent among events having the same value for the criteria expression(s), sorted by sort criteria expressions and keeps only the top events up to the given size.

This window is similar to the sorted window in that it keeps only the top (or bottom) events up to the given size, however the window also retains only the most recent among events having the same value(s) for the specified uniqueness expression(s).

The syntax is as follows:

rank(unique_expression [, unique_expression ...],
    size_expression, 
    sort_criteria_expression [asc/desc][, sort_criteria_expression [asc/desc]...])

Specify the expressions returning unique key values first. Then specify a constant value that is the size of the ranked window. Then specify the expressions returning sort criteria values. The sort criteria expressions may be followed by the optional asc or desc keywords to indicate that the values returned by that expression are sorted in ascending or descending sort order.

The window below retains only those events that have the highest 10 prices considering only the last event per symbol and reports a total price:

select sum(price) from StockTickEvent#rank(symbol, 10, price desc)

The following example retains, for the last event per market and symbol, those events that sort by price and quantity ascending into the first 10 ranks:

select * from StockTickEvent#rank(market, symbol, 10, price, quantity)

The ranked window is often used with the prev, prevwindow or prevtail single-row functions to output properties of events at a certain position or to output the complete data window according to sort order.

This example outputs every 5 seconds the top 10 events according to price descending and considering only the last event per symbol:

select prevwindow(*) from StockTickEvent#rank(symbol, 10, price desc)
  output snapshot every 5 seconds limit 1  // need only 1 row

Use the grouped window to retain a separate rank for each group. For example, the windows groupwin(market)#rank(symbol, 10, price desc) instruct the runtime to retain, per market, the highest 10 prices considering the last event per symbol.

13.3.21. Time-Order Window (`time_order` or `ext:time_order`)

This window orders events that arrive out-of-order, using timestamp-values provided by an expression, and by comparing that timestamp value to runtime time.

The syntax for this window is as follows.

time_order(timestamp_expression, time_period)

time_order(timestamp_expression, seconds_interval_expression)

The first parameter to the window is the expression that supplies timestamp values. The timestamp is expected to be a long-typed value that denotes an event's time of consideration by the window (or other expression). This is typically the time of arrival. The second parameter is a number-of-seconds expression or the time period specifying the time interval that an arriving event should maximally be held, in order to consider older events arriving at a later time.

Since the window compares timestamp values to runtime time, the window requires that the timestamp values and runtime time are both following the same clock. Therefore, to the extend that the clocks that originated both timestamps differ, the window may produce inaccurate results.

As an example, the next statement uses the arrival_time property of MyTimestampedEvent events to order and release events by arrival time:

insert rstream into ArrivalTimeOrderedStream
select rstream * from MyTimestampedEvent#time_order(arrival_time, 10 sec)

In the example above, the arrival_time property holds a long-typed timestamp value. On arrival of an event, the runtime compares the timestamp value of each event to the tail-time of the window. The tail-time of the window is, in this example, 10 seconds before runtime time (continuously sliding). If the timestamp value indicates that the event is older then the tail-time of the time window, the event is released immediately in the remove stream. If the timestamp value indicates that the event is newer then the tail-time of the window, the window retains the event until runtime time moves such that the event timestamp is older then tail-time.

The examples thus holds each arriving event in memory anywhere from zero seconds to 10 seconds, to allow for older events (considering arrival time timestamp) to arrive. In other words, the window holds an event with an arrival time equal to runtime time for 10 seconds. The window holds an event with an arrival time that is 2 seconds older then runtime time for 8 seconds. The window holds an event with an arrival time that is 10 or more seconds older then runtime time for zero seconds, and releases such (old) events immediately into the remove stream.

The insert stream of this sliding window consists of all arriving events. The remove stream of the window is ordered by timestamp value: The event that has the oldest timestamp value is released first, followed by the next newer events. Note the statement above uses the rstream keyword in both the insert into clause and the select clause to select ordered events only. It uses the insert into clause to makes such ordered stream available for subsequent statements to use.

It is up to your application to populate the timestamp property into your events or use a sensible expression that returns timestamp values for consideration by the window. The window also works well if you use externally-provided time via timer events.

13.3.22. Time-To-Live Window (`timetolive` or `ext:timetolive`)

This window retains events until runtime time reaches the value returned by the given timestamp expression.

The syntax for this window is as follows:

timetolive(timestamp_expression)

The only parameter to the window is the expression that supplies timestamp values. The timestamp is expected to be a long-typed value that denotes an event's time-to-live.

Since the window compares timestamp values to runtime time, the window requires that the timestamp values and runtime time are both following the same clock.

On arrival of an event, the runtime evaluates the timestamp expression and obtains a long-type timestamp. The runtime compares that timestamp value to runtime time:

If the timestamp is older than runtime time or the same as runtime time, the runtime releases the event immediately into the remove stream and does not retain the event at all.
If the timestamp value is newer than the runtime time, the data window retains the event until runtime time moves forward such that the timestamp is the same or older than runtime time.

As an example, the next statement uses the arrival_time property of MyTimestampedEvent events to release events by arrival time:

insert rstream into ArrivalTimeOrderedStream
select rstream * from MyTimestampedEvent#timetolive(arrival_time)

For example, assume runtime time is 8:00:00 (8 am).

If the arrival_time timestamp is 8:00:00 or older (such as 7:59:00), the data window does not retain the event at all, i.e. the runtime releases the event into the remove stream upon arrival.
If the arrival_time timestamp is after 8:00:00 the data window retains the event. Let's say the arrival_time timestamp is 8:02:00 the runtime retains the event until runtime time is 8:02:00 or newer.

The runtime evaluates the expression only once at the arrival of each event to determine that event's time-to-live.

The time-to-live data window is fully equivalent to the time-order data window with a zero value for the time period.

13.4. Special Derived-Value Windows

The derived-value windows can be used combined with data windows or alone. Very similar to aggregation functions, these windows aggregate or derive information from an event stream. As compared to aggregation functions, statistics windows can post multiple derived fields all-in-one including properties from the last event that was received. The derived fields and event properties are available for querying in the where-clause and are often compared to prior values using the prior function. Derived-value window do not retain events.

13.4.1. Size Derived-Value Window (`size`) or `std:size`)

This window posts the number of events received from a stream or window plus any additional event properties or expression values listed as parameters. The synopsis is:

size([expression, ...] [ * ])

The window posts a single long-typed property named size. The window posts the prior size as old data, and the current size as new data to update listeners of the window. Via the iterator method of the statement the size value can also be polled (read). The window only posts output events when the size count changes and does not stay the same.

As optional parameters the window takes a list of expressions that the window evaluates against the last arriving event and provides along the size field. You may also provide the * wildcard selector to have the window output all event properties.

An alternative to receiving a data window event count is the prevcount function. Compared to the size window the prevcount function requires a data window while the size window does not. The related count(...) aggregation function provides a count per group when used with group by.

When combined with a data window, the size window reports the current number of events in the data window in the insert stream and the prior number of events in the data window as the remove stream. This example reports the number of tick events within the last 1 minute:

select size from StockTickEvent#time(1 min)#size

To select additional event properties you may add each event property to output as a parameter to the window.

The next example selects the symbol and feed event properties in addition to the size property:

select size, symbol, feed from StockTickEvent#time(1 min)#size(symbol, feed)

This example selects all event properties in addition to the size property:

select * from StockTickEvent#time(1 min)#size(*)

The size window is also useful in conjunction with a groupwin grouped-window to count the number of events per group. The EPL below returns the number of events per symbol.

select size from StockTickEvent#groupwin(symbol)#size

When used without a data window, the window simply counts the number of events:

select size from StockTickEvent#size

All windows can be used with pattern statements as well. The next EPL snippet shows a pattern that looks for tick events followed by trade events for the same symbol. The size window counts the number of occurrences of the pattern.

select size from pattern[every s=StockTickEvent -> TradeEvent(symbol=s.symbol)]#size

13.4.2. Univariate Statistics Derived-Value Window (`uni` or `stat:uni`)

This window calculates univariate statistics on a numeric expression. The window takes a single value expression as a parameter plus any number of optional additional expressions to return properties of the last event. The value expression must return a numeric value:

uni(value_expression [,expression, ...] [ * ])

After the value expression you may optionally list additional expressions or event properties to evaluate for the stream and return their value based on the last arriving event. You may also provide the * wildcard selector to have the window output all event properties.

Table 13.5. Univariate Statistics Derived Properties

Property Name	Description
`datapoints`	Number of values, equivalent to `count(*)` for the stream
`total`	Sum of values
`average`	Average of values
`variance`	Variance
`stddev`	Sample standard deviation (square root of variance)
`stddevpa`	Population standard deviation

The below example selects the standard deviation on price for stock tick events for the last 10 events.

select stddev from StockTickEvent#length(10)#uni(price)

To add properties from the event stream you may simply add all additional properties as parameters to the window.

This example selects all of the derived values, based on the price property, plus the values of the symbol and feed event properties:

select * from StockTickEvent#length(10)#uni(price, symbol, feed)

The following example selects all of the derived values plus all event properties:

select * from StockTickEvent#length(10)#uni(price, symbol, *)

13.4.3. Regression Derived-Value Window (`linest` or `stat:linest`)

This window calculates regression and related intermediate results on the values returned by two expressions. The window takes two value expressions as parameters plus any number of optional additional expressions to return properties of the last event. The value expressions must return a numeric value:

linest(value_expression, value_expression [,expression, ...] [ * ])

After the two value expressions you may optionally list additional expressions or event properties to evaluate for the stream and return their value based on the last arriving event. You may also provide the * wildcard selector to have the window output all event properties.

Table 13.6. Regression Derived Properties

Property Name	Description
`slope`	Slope.
`YIntercept`	Y intercept.
`XAverage`	X average.
`XStandardDeviationPop`	X standard deviation population.
`XStandardDeviationSample`	X standard deviation sample.
`XSum`	X sum.
`XVariance`	X variance.
`YAverage`	X average.
`YStandardDeviationPop`	Y standard deviation population.
`YStandardDeviationSample`	Y standard deviation sample.
`YSum`	Y sum.
`YVariance`	Y variance.
`dataPoints`	Number of data points.
`n`	Number of data points.
`sumX`	Sum of X (same as X Sum).
`sumXSq`	Sum of X squared.
`sumXY`	Sum of X times Y.
`sumY`	Sum of Y (same as Y Sum).
`sumYSq`	Sum of Y squared.

The next example calculates regression and returns the slope and y-intercept on price and offer for all events in the last 10 seconds.

select slope, YIntercept from StockTickEvent#time(10 seconds)#linest(price, offer)

To add properties from the event stream you may simply add all additional properties as parameters to the window.

This example selects all of the derived values, based on the price and offer properties, plus the values of the symbol and feed event properties:

select * from StockTickEvent#time(10 seconds)#linest(price, offer, symbol, feed)

The following example selects all of the derived values plus all event properties:

select * from StockTickEvent#time(10 seconds)#linest(price, offer, *)

13.4.4. Correlation Derived-Value Window (`correl` or `stat:correl`)

This window calculates the correlation value on the value returned by two expressions. The window takes two value expressions as parameters plus any number of optional additional expressions to return properties of the last event. The value expressions must be return a numeric value:

correl(value_expression, value_expression [,expression, ...] [ * ])

Table 13.7. Correlation Derived Properties

Property Name	Description
`correlation`	Correlation between two event properties

The next example calculates correlation on price and offer over all stock tick events for GE:

select correlation from StockTickEvent(symbol='GE')#correl(price, offer)

To add properties from the event stream you may simply add all additional properties as parameters to the window.

This example selects all of the derived values, based on the price and offer property, plus the values of the feed event property:

select * from StockTickEvent(symbol='GE')#correl(price, offer, feed)

The next example selects all of the derived values plus all event properties:

select * from StockTickEvent(symbol='GE')#correl(price, offer, *)

13.4.5. Weighted Average Derived-Value Window (`weighted_avg` or `stat:weighted_avg`)

This window returns the weighted average given an expression returning values to compute the average for and an expression returning weight. The window takes two value expressions as parameters plus any number of optional additional expressions to return properties of the last event. The value expressions must return numeric values:

weighted_avg(value_expression_field, value_expression_weight [,expression, ...] [ * ])

Table 13.8. Weighted Average Derived Properties

Property Name	Description
`average`	Weighted average

A statement that derives the volume-weighted average price for the last 3 seconds for a given symbol is shown below:

select average 
from StockTickEvent(symbol='GE')#time(3 seconds)#weighted_avg(price, volume)

To add properties from the event stream you may simply add all additional properties as parameters to the window.

This example selects all of the derived values, based on the price and volume properties, plus the values of the symbol and feed event properties:

select *
from StockTickEvent#time(3 seconds)#weighted_avg(price, volume, symbol, feed)

The next example selects all of the derived values plus the values of all event properties:

select *
from StockTickEvent#time(3 seconds)#weighted_avg(price, volume, *)

Aggregation functions could instead be used to compute the weighted average as well. The next example also posts weighted average per symbol considering the last 3 seconds of stock tick data:

select symbol, sum(price*volume)/sum(volume)
from StockTickEvent#time(3 seconds) group by symbol

The following example computes weighted average keeping a separate data window per symbol considering the last 5 events of each symbol:

select symbol, average
from StockTickEvent#groupwin(symbol)#length(5)#weighted_avg(price, volume)

Chapter 14. Compiler Reference

14.1. Introduction

14.2. Concepts

14.2.1. Module
14.2.2. EPL-objects
14.2.3. Dependencies
14.2.4. Dependency Resolution
14.2.5. Access Modifiers
14.2.6. Bus Modifier for Event Types

14.3. Compiling a Module

14.4. Reading and Writing a Compiled Module

14.5. Reading Module Content

14.6. Compiler Arguments

14.6.1. Compiler Configuration
14.6.2. Compiler Path
14.6.3. Compiler Options

14.7. Statement Object Model

14.7.1. Building an Object Model
14.7.2. Building Expressions
14.7.3. Building a Pattern Statement
14.7.4. Building a Select Statement
14.7.5. Building a Create-Variable and On-Set Statement
14.7.6. Building Create-Window, On-Delete and On-Select Statements

14.8. Substitution Parameters

14.9. OSGi, Class Loader, Class-For-Name

14.10. Authoring Tools

14.11. Testing Tools

14.12. Debugging

14.12.1. @Audit Annotation

14.13. Ordering Multiple Modules

14.14. Logging

14.15. Debugging Generated Code

14.1. Introduction

The compiler provides the following functions:

Compiles a module to JVM byte code.
Compiles a fire-and-forget query to JVM byte code.
Parses a module producing a module object model.
Parses a statement producing a statement object model.
Validates the syntax of a module.
Reads a module from external sources.

The most important function of the compiler is to produce byte code for your module. Deploy the byte code into a runtime for execution.

The compiler interface is EPCompiler in package com.espertech.esper.compiler.client. You application obtains a compiler instance by calling the getCompiler method of EPCompilerProvider.

For example:

EPCompiler epCompiler = EPCompilerProvider.getCompiler();

Use the compiler as follows:

The compiler is a stateless service. It does not have any state that it keeps between calls.
You may obtain and use any number of compiler instances in parallel.
You may share a compiler instance between threads.
All compiler methods are thread-safe.

14.2. Concepts

14.2.1. Module

A module contains zero, one or multiple statements. A module is a source code unit as the compiler turns a module into byte code. A module does not need to be a text - a module can also be an object model.

In module text, statements appear separated by the semicolon (;) character. If there is a single statement in the module the semicolon can be left off.

The synopsis of a module file is:

[module module_name;]
	[uses module_name; | import import_name;] [uses module_name; | import import_name;] [...]
	[epl_statement;] [epl_statement;] [...]

Use the module keyword followed a module_name identifier or a package (identifiers separated by dots) to declare the name of the module. The module name declaration must be at the beginning of the file, comments and whitespace excluded. The module name serves to check uses-dependences of other modules.

If a module file requires certain constructs that may be shared by other modules, such as named windows, tables, variables, event types, variant streams or inserted-into streams required by statements, a module file may specify dependent modules with the uses keyword. This servers to avoid name conflicts and automatic deployment can use this information to determine deployment order.

If the statements in the module require Java classes such as for underlying events or user-defined functions, use the import keyword followed by the fully-qualified class name or package name in the format package.*. The uses and import keywords are optional and must occur after the module declaration.

Following the optional deployment instructions are any number of epl_statement statements that are separated by semicolon (;).

The following is a sample module file explained in detail thereafter:

// Declare the name for the module (optional).
module org.myorganization.switchmonitor;

// Declare other module(s) that this module depends on (optional).
// This can be used to resolve name conflicts.
uses org.myorganization.common;

// Import any Java/.NET classes of the given package name (optional). 
// Imports only handle classpath and do not import other modules.
import org.myorganization.events.*;

// Declare an event type based on a Java class in the package that was imported as above
create schema MySwitchEvent as MySwitchEventPOJO;

// Sample statement
@Name('Off-On-Detector')
insert into MyOffOnStream
select * from pattern[every-distinct(id) a=MySwitchEvent(status='off') 
  -> b=MySwitchEvent(id=a.id, status='on')];

// Sample statement
@Name('Count-Switched-On')
@Description('Count per switch id of the number of Off-to-On switches in the last 1 hour')
select id, count(*) from MyOffOnStream#time(1 hour) group by id;

The example above declares a module name of org.myorganization.switchmonitor. The example demonstrates the import keyword to make a package name known to the compiler for resolving classpath items, as the example assumes that MySwitchEventPOJO is a POJO event class. In addition the example module contains two statements separated by semicolon characters.

14.2.2. EPL-objects

The following types of EPL-objects are managed by the compiler and runtime:

Event types define stream type information and are added using create schema or by configuration.
Variables are free-form value holders and are added using create variable or by configuration.
Named windows are sharable named data windows and are added using create window.
Tables are sharable organized rows with columns that are simple, aggregation and complex types, and are added using create table.
Contexts define analysis lifecycle and are added using create context.
Expressions and Scripts are reusable expressions and are added using create expression.
Indexes organize named window events and table rows for fast lookup and are added using create index.

Your application can pre-configure event types and variables in a Configuration object.

A module can create any number of EPL-objects.

A module can depend on EPL-objects that are pre-configured or other modules created.

14.2.3. Dependencies

A module usually depends on event types and may also depend on other EPL-objects such as named windows or tables, for example. The compiler resolves all dependencies at compile-time. It produces byte code based on the information associated with the EPL-object. Upon deploying a compiled module's byte code into the runtime the runtime validates that dependencies exist.

For example, consider the following module:

select accountId, amount from Withdrawal

The module above depends on the event type Withdrawal. The compiler resolves the event type name to an EventType instance. It produces code according to the event type. At time of deployment of the compiled module the runtime verifies that the Withdrawal event type exists.

Specifically, the compiler generates code like this:

If the Withdrawal event type is a Map-based event type, the compiler produces code such as event.get("accountId").
If the Withdrawal event type is an Object-Array-based event type, the compiler produces code such as event[index].
If the Withdrawal event type is a Bean-based event type, the compiler produces code such as event.getAccountId().

Note

The compiler only tracks dependencies on EPL-objects.

The compiler does not track classpath dependencies. The runtime does not validate classpath dependencies.

The runtime validates that EPL-object dependencies exist before deploying a compiled module.

The runtime does not validate that the information about the EPL-object is the same as at compile-time.

In other words, the runtime does not validate that event property names, event property types, table column names and types, variable types, index property names and other compile-time information matches the information that was provided at compile time.

14.2.4. Dependency Resolution

The compiler resolves an EPL-object by its name by looking at:

The EPL-objects created by the same module (also known as local)
The EPL-objects created by the other modules (also known as path).
The pre-configured event types and variables.

The term path encompasses the EPL-objects other modules define. The term local encompasses the EPL-objects the same module defines.

Coming back to the previous example:

select accountId, amount from Withdrawal

The compiler finds an event type by name Withdrawal by:

Checking whether Withdrawal is an event type that the same module defined by create schema (local).
Checking whether Withdrawal is an event type that another modules defined by create schema (path).
Checking whether Withdrawal is a pre-configured event type.

In case the name cannot be resolved the compilation fails.

In case the name is found multiple times, the compiler checks as follows:

If the name is a pre-configured EPL-object and the name is also found in path the validation fails.
If the name is found in local, and the name is found in path or preconfigured, the validation fails.
If the name is found in path for multiple modules, and if there is no module-uses provided, the validation fails.
If the name is found in path for multiple modules and there are module-uses module names provided the EPL object module name must match one of the module names in module-uses.

14.2.5. Access Modifiers

Access level modifiers determine whether other modules can use a particular EPL-object.

An EPL-object may be declared with the modifier public, in which case that EPL-object is visible to all other modules.

An EPL-object may be declared with the modifier protected, in which case that EPL-object is visible to other modules that have the same module name.

An EPL-object may be declared with the modifier private (the default), in which case that EPL-object is not visible to other modules.

Your application may set access modifiers by:

Using an annotation i.e. @public, @protected, @private.
Setting default access modifiers in the ConfigurationCompilerByteCode that is part of the Configuration object.
Computing access modifiers by providing a callback in CompilerOptions compiler options. Any computed value overrides the annotation or configuration default.

The following module declares a public named window to hold the last 10 seconds of withdrawal events:

@public create window WithdrawalWindow#time(10) as Withdrawal

14.2.6. Bus Modifier for Event Types

For event types there is a bus modifier that determines whether or not the event type is available for use with the sendEventType methods of the EPEventService runtime event service.

An event type may be declared with the bus modifier, in which case calls to sendEventType process the event.

An event type may be declared with the non-bus modifier (the default), in which case calls to sendEventType cause an exception to be thrown.

To understand this better, here is what sendEventType of EPEventService does: When your application calls any of the sendEventBean, sendEventMap, sendEventObjectArray, sendEventXMLDOM or sendEventAvro methods of EPEventService, the runtime finds the event type using the event type name that is passed. It associates the event type to the event object for processing the given event. If the event type name is not recognized or the event type does not have the bus modifier it throws an exception.

The bus modifier is not required for pre-configured event types. The bus modifier requires public access.

Your application may set the bus modifier by:

Using the @buseventtype annotation.
Setting the default bus modifier in the ConfigurationCompilerByteCode that is part of the Configuration object.
Computing a bus modifier by providing a callback in CompilerOptions compiler options. Any computed value overrides the annotation or configuration default.

The following module declares a public event type that allows an application to send in events of that name:

@public @buseventtype create schema AccountQueryEvent (accountId string)

The information herein pertains to the routeEventType and EventSender as well.

14.3. Compiling a Module

The compile method takes two parameters. The first parameter is the module text or an module object model. The second parameter are compiler arguments.

The output of the compiler is an EPCompiled instance. You can deploy EPCompiled instances directly into a runtime as described in Section 15.4, “Deploying and Undeploying Using EPDeploymentService”.

14.4. Reading and Writing a Compiled Module

The EPCompiledIOUtil class is a utility for writing and reading EPCompiled instances to and from jar-files:

Write an EPCompiled instance to a jar file.
Read a jar file previously written by EPCompiledIOUtil and return an EPCompiled instance.

14.5. Reading Module Content

Read and parse module files using the readModule and parseModule methods, which return a Module instance to represent the module information.

This code snippet demonstrates reading and parsing a module given a file name:

Module module = EPCompilerProvider.getCompiler().read(new File("switchmonitor.epl"));

14.6. Compiler Arguments

The compiler arguments are:

The Configuration object can provide pre-configured event types and variables as well as other compiler settings.
The CompilerPath passes information that the compiler uses to determine the EPL-objects that the module may depend on.
The CompilerOptions are compiler instructions.

14.6.1. Compiler Configuration

Pass a Configuration instance to the compiler to configure the compiler. By default the compiler uses an empty configuration object.

The compiler only uses the common section and the compiler section of the configuration. The compiler ignores the runtime section of the configuration.

It is not necessary to pass a configuration object or to pre-configure event types. You may create event types by means of create schema.

A pre-configured event types is a convenience since the event type is already defined and ready to use. The common section of the configuration holds the pre-configured event types. The following sample adds a pre-configured WithdrawalEvent map-based event type:

Map<String, Object> columns = new LinkedHashMap<>();
columns.put("accountId", String.class);
columns.put("amount", double.class);

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("WithdrawalEvent", columns);
CompilerArguments args = new CompilerArguments(configuration);

To obtain a configuration object from a runtime call getConfigurationDeepCopy on EPRuntime:

Configuration configuration = epRuntime.getConfigurationDeepCopy();
CompilerArguments args = new CompilerArguments(configuration);

More information on the common and compiler configuration can be found at TODO.

14.6.1.1. Configuring the Compiler for Subscribers

By default the compiler does not generate code for subscribers and the setSubscriber method on EPStatement throws an exception.

You may set the allowSubscriber option:

Configuration configuration = new Configuration();
configuration.getCompiler().getByteCode().setAllowSubscriber(true);
CompilerArguments args = new CompilerArguments(configuration);

14.6.2. Compiler Path

The compiler path provides EPL-objects that other modules may declare and that the current module may use.

For example, assume a module M₁ that declares a named window WithdrawalWindow:

@public create window WithdrawalWindow#time(10) as Withdrawal

A second module M₂ may query the named window like so:

select (select count(*) from WithdrawalWindow) as cnt from Withdrawal

Module M₂ depends on the EPL-object WithdrawalWindow (a named window) that module M₁ declares.

You can build a path from:

An existing runtime. This adds all EPL-objects that are currently deployed into the runtime to the path.
Compiled modules.

Assume that your application compiled module M₁ like so:

Map<String, Object> columns = new LinkedHashMap<>();
columns.put("accountId", String.class);
columns.put("amount", double.class);

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("WithdrawalEvent", columns);

CompilerArguments arguments = new CompilerArguments(configuration);
EPCompiled compiledModuleM1 = EPCompilerProvider.getCompiler().compile("@public create window WithdrawalWindow#time(10) as Withdrawal", arguments);

The compiledModuleM1 instance holds the byte code of module M₁.

14.6.2.1. Compiling Against a Runtime

After deploying compiled modules to a runtime, the compiler can build the path from the runtime.

The getRuntimePath method of EPRuntime returns the path object for use by the compiler. The path object is an instance of EPCompilerPathable.

The add method of CompilerPath accepts a EPCompilerPathable instance provided by a runtime.

For example, as follows:

Map<String, Object> columns = new LinkedHashMap<>();
columns.put("accountId", String.class);
columns.put("amount", double.class);

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("WithdrawalEvent", columns);

// Get a runtime
EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(configuration);
runtime.getDeploymentService().deploy(compiledModuleM1);

// Compile another module
CompilerArguments arguments = new CompilerArguments(configuration);
arguments.getPath().add(runtime.getRuntimePath());
EPCompiled compiledModuleM2 = EPCompilerProvider.getCompiler().compile("select (select count(*) from WithdrawalWindow) as cnt from Withdrawal", arguments);

14.6.2.2. Adding a Compiled Module to Path

Use the addPath method of CompilerPath to add a compiled module to path.

For example, as follows:

CompilerArguments arguments = new CompilerArguments(configuration);
arguments.getPath().add(compiledModuleM1);
EPCompiled compiledModuleM2 = EPCompilerProvider.getCompiler().compile("select (select count(*) from WithdrawalWindow) as cnt from Withdrawal", arguments);

14.6.3. Compiler Options

Compiler options provide compiler callbacks and other compile-time parameters:

Provide or override access modifiers and bus event type modifier.
Provide or override the statement name.
Provide a statement user object and that can be obtained from an EPStatement with getUserObjectCompileTime.
Provide or override the module name.
Provide or override module-uses information.

Please consult the JavaDoc for more information.

14.7. Statement Object Model

The statement object model is a set of classes that provide an object-oriented representation of statement. The object model classes are found in package com.espertech.esper.common.client.soda. An instance of EPStatementObjectModel represents a statement's object model.

The statement object model classes are a full and complete specification of a statement. All EPL constructs including expressions and sub-queries are available in the statement object model.

The statement object model provides the means to building, changing or interrogating statements beyond the string representation. The object graph of the statement object model is fully navigable for easy querying by code, and is also serializable allowing applications to persist or transport statements in object form, when required.

The statement object model supports full round-trip from object model to statement string and back to object model: A statement object model can be rendered into a string representation via the toEPL method on EPStatementObjectModel. Further, the compiler API allows compiling a statement string into an object model representation via the eplToModel method on EPCompiler.

The statement object model is fully mutable. Mutating any list such as returned by getChildren(), for example, is acceptable and supported.

The following limitations apply:

Statement object model classes are not safe for sharing between threads other than for read access.
Between versions the serialized form of the object model is subject to change. There are no guarantees that the serialized object model of one version will be fully compatible with the serialized object model generated by another version. Please consider this issue when storing object models in persistent store.

14.7.1. Building an Object Model

A EPStatementObjectModel consists of an object graph representing all possible clauses that can be part of a statement.

Among all clauses, the SelectClause and FromClause objects are required clauses that must be present, in order to define what to select and where to select from.

Table 14.1. Required Statement Object Model Instances

Class	Description
EPStatementObjectModel	All statement clauses for a statement, such as the select-clause and the from-clause, are specified within the object graph of an instance of this class
SelectClause	A list of the selection properties or expressions, or a wildcard
FromClause	A list of one or more streams; A stream can be a filter-based, a pattern-based, SQL-based and other; Add data windows here.

Part of the statement object model package are convenient builder classes that make it easy to build a new object model or change an existing object model. The SelectClause and FromClause are such builder classes and provide convenient create methods.

Within the from-clause you have a choice of different streams to select on. The FilterStream class represents a stream that is filled by events of a certain type and that pass an optional filter expression.

We can use the classes introduced above to create a simple statement object model:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setSelectClause(SelectClause.createWildcard());
model.setFromClause(FromClause.create(FilterStream.create("ReadyEvent")));

The model as above is equivalent to:

select * from ReadyEvent

Notes on usage:

Variable names can simply be treated as property names.
When selecting from named windows or tables, the name of the named window or table is the event type name for use in FilterStream instances or patterns.
To compile an arbitrary sub-expression text into an Expression object representation, simply add the expression text to a where clause, compile the EPL string into an object model via the eplToModel method on EPCompiler, and obtain the compiled where from the EPStatementObjectModel via the getWhereClause method.

14.7.2. Building Expressions

The EPStatementObjectModel includes an optional where-clause. The where-clause is a filter expression that the runtime applies to events in one or more streams. The key interface for all expressions is the Expression interface.

The Expressions class provides a convenient way of obtaining Expression instances for all possible expressions. Please consult the JavaDoc for detailed method information. The next example discusses sample where-clause expressions.

Use the Expressions class as a service for creating expression instances, and add additional expressions via the add method that most expressions provide.

The next example adds a simple where-clause to the EPL as shown earlier:

select * from ReadyEvent where line=8

And the code to add a where-clause to the object model is below.

model.setWhereClause(Expressions.eq("line", 8));

The following example considers a more complex where-clause. Assume you need to build an expression using logical-and and logical-or:

select * from ReadyEvent 
where (line=8) or (line=10 and age<5)

The code for building such a where-clause by means of the object model classes is:

model.setWhereClause(Expressions.or()
  .add(Expressions.eq("line", 8))
  .add(Expressions.and()
      .add(Expressions.eq("line", 10))
      .add(Expressions.lt("age", 5))
  ));

14.7.3. Building a Pattern Statement

The Patterns class is a factory for building pattern expressions. It provides convenient methods to create all pattern expressions of the pattern language.

Patterns in EPL are seen as a stream of events that consist of patterns matches. The PatternStream class represents a stream of pattern matches and contains a pattern expression within.

For instance, consider the following pattern statement.

select * from pattern [every a=MyAEvent and not b=MyBEvent]

The next code snippet outlines how to use the statement object model and specifically the Patterns class to create a statement object model that is equivalent to the pattern statement above.

EPStatementObjectModel model = new EPStatementObjectModel();
model.setSelectClause(SelectClause.createWildcard());
PatternExpr pattern = Patterns.and()
  .add(Patterns.everyFilter("MyAEvent", "a"))
  .add(Patterns.notFilter("MyBEvent", "b"));
model.setFromClause(FromClause.create(PatternStream.create(pattern)));

14.7.4. Building a Select Statement

This section builds a complete example statement and includes all optional clauses in one statement, to demonstrate the object model API.

A sample statement:

insert into ReadyStreamAvg(line, avgAge) 
select line, avg(age) as avgAge 
from ReadyEvent(line in (1, 8, 10))#time(10) as RE
where RE.waverId != null
group by line 
having avg(age) < 0
output every 10.0 seconds 
order by line

Finally, this code snippet builds the above statement from scratch:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setInsertInto(InsertIntoClause.create("ReadyStreamAvg", "line", "avgAge"));
model.setSelectClause(SelectClause.create()
    .add("line")
    .add(Expressions.avg("age"), "avgAge"));
Filter filter = Filter.create("ReadyEvent", Expressions.in("line", 1, 8, 10));
model.setFromClause(FromClause.create(
    FilterStream.create(filter, "RE").addView("win", "time", 10)));
model.setWhereClause(Expressions.isNotNull("RE.waverId"));
model.setGroupByClause(GroupByClause.create("line"));
model.setHavingClause(Expressions.lt(Expressions.avg("age"), Expressions.constant(0)));
model.setOutputLimitClause(OutputLimitClause.create(OutputLimitSelector.DEFAULT, Expressions.timePeriod(null, null, null, 10.0, null)));
model.setOrderByClause(OrderByClause.create("line"));

14.7.5. Building a Create-Variable and On-Set Statement

This sample statement creates a variable:

create variable integer var_output_rate = 10

The code to build the above statement using the object model:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setCreateVariable(CreateVariableClause.create("integer", "var_output_rate", 10));

A second statement sets the variable to a new value:

on NewValueEvent set var_output_rate = new_rate

The code to build the above statement using the object model:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setOnExpr(OnClause.createOnSet("var_output_rate", Expressions.property("new_rate")));
model.setFromClause(FromClause.create(FilterStream.create("NewValueEvent")));

14.7.6. Building Create-Window, On-Delete and On-Select Statements

This sample statement creates a named window:

create window OrdersTimeWindow#time(30 sec) as select symbol as sym, volume as vol, price from OrderEvent

The is the code that builds the create-window statement as above:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setCreateWindow(CreateWindowClause.create("OrdersTimeWindow").addView("win", "time", 30));
model.setSelectClause(SelectClause.create()
		.addWithName("symbol", "sym")
		.addWithName("volume", "vol")
		.add("price"));
model.setFromClause(FromClause.create(FilterStream.create("OrderEvent)));

A second statement deletes from the named window:

on NewOrderEvent as myNewOrders
delete from OrdersNamedWindow as myNamedWindow
where myNamedWindow.symbol = myNewOrders.symbol

The object model is built by:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setOnExpr(OnClause.createOnDelete("OrdersNamedWindow", "myNamedWindow"));
model.setFromClause(FromClause.create(FilterStream.create("NewOrderEvent", "myNewOrders")));
model.setWhereClause(Expressions.eqProperty("myNamedWindow.symbol", "myNewOrders.symbol"));

A third statement selects from the named window using the non-continuous on-demand selection via on-select:

on QueryEvent(volume>0) as query
select count(*) from OrdersNamedWindow as win
where win.symbol = query.symbol

The on-select statement is built from scratch via the object model as follows:

EPStatementObjectModel model = new EPStatementObjectModel();
model.setOnExpr(OnClause.createOnSelect("OrdersNamedWindow", "win"));
model.setWhereClause(Expressions.eqProperty("win.symbol", "query.symbol"));
model.setFromClause(FromClause.create(FilterStream.create("QueryEvent", "query", 
  Expressions.gt("volume", 0))));
model.setSelectClause(SelectClause.create().add(Expressions.countStar()));

14.8. Substitution Parameters

Substitution parameters have the following syntax:

? [:[name] [:type]]

The name is optional. The absence of a name means the substitution parameter is only addressable by index.

The type is optional. The absence of the type means the type of the substitution parameter is java.lang.Object. Use cast or provide a type name when your expression requires a strongly-typed value.

Here are a few examples of valid substitution parameters:

Table 14.2. Valid Substitution Parameters

Value	Description
?	Unnamed and typed `Object`.
?::int	Unnamed and typed `int`.
?:param:string	Named and typed `string`.

All substitution parameters must either be unnamed or named. It is not possible to mix the two styles.

If not assigning a name to substitution parameters, the compiler assigns the first substitution parameter an index of 1 and subsequent parameters increment the index by one.

If assigning a name to each substitution parameter, the name can include slash (/) characters and can occur multiple times.

Substitution parameters can be inserted into any EPL construct that takes an expression. They are therefore valid in any clauses such as the select-clause, from-clause filters, where-clause, group-by-clause, having-clause or order-by-clause, including data window parameters and pattern observers and guards, for example. Substitution parameters cannot be used where a numeric constant is required rather than an expression and in SQL statements.

You may use square brackets ([]) to denote array-types and [primitive] for array of primitive. For example int[primitive] for array of int-primitive and int[] for array of Integer.

All substitution parameters must be replaced by actual values at time of deployment.

14.9. OSGi, Class Loader, Class-For-Name

The configuration object (Configuration), in respect to classes, holds the fully-qualified class name and does not generally hold Class references. This is by design since the configuration object can be populated from XML.

The compiler may need to look up a class by name and may need to obtain a class loader. Your application has full control over class-for-name and classloader use. OSGi environments can provide a specific class-for-name and class loader. Please refer to Section 16.7, “Passing Services or Transient Objects”.

14.10. Authoring Tools

Enterprise Edition includes authoring tools for statements and modules by providing form-based dialogs, templates, an expression builder, simulation tool and other tools. Enterprise Edition also supports hot deployment and packaging options for EPL and related code.

Statements can be organized into modules as described above. Any text editor can edit statements and module text. A text editor or IDE that highlights SQL syntax or keywords works.

For authoring configuration files please consult the XSD schema files as provided with the distribution.

For information on authoring event classes or event definitions in general please see Chapter 3, Event Representations or Section 5.15, “Declaring an Event Type: Create Schema”.

14.11. Testing Tools

We recommend testing modules using a test framework such as JUnit or TestNG. Please consult the regression test suite for extensive examples, which can be downloaded from the distribution site.

Esper's API provides test framework classes to simplify automated testing of statements. Please see Section 15.18, “Test and Assertion Support” for more information.

We recommend performing latency and throughput tests early in the development lifecycle. Please consider the performance tips in Chapter 22, Performance for optimal performance.

Consider runtime and statement metrics reporting for identifying slow-performing statements, for example. See Section 15.12, “Runtime and Statement Metrics Reporting”.

14.12. Debugging

Enterprise Edition includes a debugger for module execution.

One important tool for debugging without Enterprise Edition is the parameterized @Audit annotation. This annotation allows to output, on statement-level, detailed information about many aspects of statement processing.

Another tool for logging runtime-level detail is Section 16.6.2.1, “Execution Path Debug Logging”.

Please see Section 16.9, “Logging Configuration” for information on configuring logging in general.

14.12.1. @Audit Annotation

Use the @Audit annotation to have the runtime output detailed information about statement processing. The runtime reports, at INFO level, the information under log name com.espertech.esper.audit. You may define an output format for audit information via configuration.

You may provide a comma-separated list of category names to @Audit to output information related to specific categories only. The table below lists all available categories. If no parameter is provided, the runtime outputs information for all categories. Category names are not case-sensitive.

For the next statement the runtime produces detailed processing information (all categories) for the statement:

@Name('All Order Events') @Audit select * from OrderEvent

For the next statement the runtime provides information about new events and also about event property values (2 categories are listed):

@Name('All Order Events') @Audit('stream,property') select price from OrderEvent

Here is a more complete example that uses the API to create the schema, create above statement and send an event:

try {
  String module =
    "@public @buseventtype create schema OrderEvent(price double);\n" +
    "@name('All-Order-Events') @Audit('stream,property') select price from OrderEvent;\n";
  EPCompiled compiled = EPCompilerProvider.getCompiler().compile(module, null);

  EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime();
  EPDeployment deployment = runtime.getDeploymentService().deploy(compiled);
  deployment.getStatements()[0].addListener(new SupportUpdateListener());
  runtime.getEventService().sendEventMap(Collections.singletonMap("price", 100d), "OrderEvent");
} catch (Throwable t) {
  log.error(t.getMessage(), t);
}

The output is similar to the following:

INFO  [audit] Statement All-Order-Events stream OrderEvent inserted {price=100.0}
INFO  [audit] Statement All-Order-Events property price value 100.0

Table 14.3. @Audit Categories

Category	Description
ContextPartition	Each context partition allocation and de-allocation (only for statements that declare a context).
Dataflow-Source	Each data flow source operator providing an event.
Dataflow-Op	Each data flow operator processing an event.
Dataflow-Transition	Each data flow instance state transition.
Exprdef	Each expression declaration name and return value.
Expression	Each top-level expression and its return value.
Expression-nested	Each expression including child or nested expressions and their return value.
Insert	Each event inserted via insert-into.
Pattern	Each pattern sub-expression and its change in truth-value.
Pattern-instances	Each pattern sub-expression and its count of active instances.
Property	Each property name and the event's property value.
Schedule	Each schedule modification and trigger received by a statement.
Stream	Each new event received by a statement.
View	Each data window name and its insert and remove stream.

Note that the runtime only evaluates select-clause expressions if either a listener or subscriber is attached to the statement or if used with insert-into.

14.13. Ordering Multiple Modules

Since modules may have inter-dependencies as discussed under the uses declaration, there is a ModuleOrderUtil class that provides the getModuleOrder method to order a collection of modules before deployment.

Assuming your application reads multiple modules into a mymodules module list, this code snippet orders the modules for deployment and validates dependency declarations for each module:

List<Module> mymodules =  ... read modules...;  
ModuleOrder order = ModuleOrderUtil.getModuleOrder(mymodules, new ModuleOrderOptions());

14.14. Logging

You can log generated classes at INFO log level by setting the configuration flag for code logging as described in Section 16.5.3.1, “Byte Code Generation Logging”.

14.15. Debugging Generated Code

The information herein is for developers and is specific to the Janino compiler at the version provided with the distribution.

Set the system property org.codehaus.janino.source_debugging.enable to true to have Janino compile code with debug symbols.

Set the system property org.codehaus.janino.source_debugging.dir to a file system directory to have Janino generate classes into a given directory.

The IDE can debug into generated classes and show the source code provided that the IDE can access the source code. For example:

-Dorg.codehaus.janino.source_debugging.dir=/path/to/directory
-Dorg.codehaus.janino.source_debugging.enable=true

To include additional EPL-related comments in the generated code you can change the configuration as outlined in Section 16.5.1, “Compiler Settings Related to Byte Code Generation”.

Chapter 15. Runtime Reference

15.1. Introduction

15.2. Obtaining a Runtime From EPRuntimeProvider

15.3. The EPRuntime Runtime Interface

15.4. Deploying and Undeploying Using EPDeploymentService

15.4.1. Substitution Parameters
15.4.2. Atomic Deployment Management

15.5. Obtaining Results Using EPStatement

15.5.1. Receiving Statement Results
15.5.2. Setting a Subscriber Object
15.5.3. Adding Listeners
15.5.4. Using Iterators
15.5.5. Event and Event Type
15.5.6. Interrogating Annotations

15.6. Processing Events and Time Using EPEventService

15.6.1. Event Sender
15.6.2. Receiving Unmatched Events

15.7. Execute Fire-and-Forget Queries Using EPFireAndForgetService

15.7.1. Fire-and-forget Query Single Execution
15.7.2. Fire-and-forget Query Prepared Unparameterized Execution
15.7.3. Fire-and-forget Query Prepared Parameterized Execution

15.8. Runtime Threading and Concurrency

15.8.1. Advanced Threading
15.8.2. Processing Order

15.9. Controlling Time-Keeping

15.9.1. Controlling Time Using Time Span Events
15.9.2. Time Resolution and Time Unit
15.9.3. Internal Timer Based on JVM System Time

15.10. Exception Handling

15.11. Condition Handling

15.12. Runtime and Statement Metrics Reporting

15.12.1. Runtime Metrics
15.12.2. Statement Metrics

15.13. Monitoring and JMX

15.14. Event Rendering to XML and JSON

15.14.1. JSON Event Rendering Conventions and Options
15.14.2. XML Event Rendering Conventions and Options

15.15. Plug-In Loader

15.16. Context Partition Selection

15.16.1. Selectors

15.17. Context Partition Administration

15.18. Test and Assertion Support

15.18.1. EPAssertionUtil Summary
15.18.2. SupportUpdateListener Summary
15.18.3. Usage Example

15.19. OSGi, Class Loader, Class-For-Name

15.20. When Deploying with J2EE

15.20.1. J2EE Deployment Considerations
15.20.2. Servlet Context Listener

15.1. Introduction

The runtime takes on these functions:

Provide an environment to execute compiled modules.
Provide an environment to run compiled fire-and-forget queries.
Process incoming events and time against deployed modules.

Your application obtains a runtime from EPRuntimeProvider. You may pass an arbitrary string-type runtime URI that uniquely identifies the runtime instance.

A runtime is an instance of EPRuntime. Use the runtime as follows:

The runtime is a stateful service.
You may obtain and use any number of runtime instances in parallel, each runtime instance uniquely identified by the runtime URI.
You may share a runtime instance between threads.
All runtime methods are thread-safe.
Each runtime is completely independent of other runtimes.

15.2. Obtaining a Runtime From `EPRuntimeProvider`

The EPRuntimeProvider class provides static methods that return EPRuntime runtimes.

Each runtime has a unique runtime URI which can be any string value. If your application does not pass a runtime URI then the default URI is default (as defined by EPRuntimeProvider.DEFAULT_RUNTIME_URI).

For the getRuntime methods, your application can pass a runtime URI to obtain different runtimes. The EPRuntimeProvider determines whether the provided runtime URI matches any existing runtime URIs and returns the existing runtime, or allocates a new runtime if none was found.

The getExistingRuntime method takes a runtime URI and returns the existing runtime for that URI or null if there is none.

The code snip below gets the default runtime. Subsequent calls to get the default runtime return the same runtime.

EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime();

The next code gets a runtime for the runtime URI RFIDProcessor1. Subsequent calls to get a runtime with the same runtime URI return the same runtime instance.

EPRuntime runtime = EPRuntimeProvider.getRuntime("RFIDProcessor1");

Since the getRuntime methods return the same runtime for each URI there is no need to statically cache a runtime in your application.

You may also pass an optional Configuration. The next code snippet outlines a typical sequence of use:

// Configure the runtime, this is optional
Configuration config = new Configuration();
config.configure("configuration.xml");	// load a configuration from file

// Optionally set additional configuration values like so:
// config.getCommon().add....(...);

// Obtain a runtime
EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(config);

// Optionally, use initialize if the same runtime has been used before to start clean
runtime.initialize();

// Destroy the runtime when no longer needed, frees up resources, releases the runtime URI
runtime.destroy();

15.3. The `EPRuntime` Runtime Interface

The EPRuntime interface represents a runtime. Only the static methods of the EPRuntimeProvider class allocate new runtimes. A runtime is uniquely identified by runtime URI. The runtime URI is an arbitrary string. The default runtime has a runtime URI of default.

A runtime provides these services:

Table 15.1. Choices For Receiving Statement Results

Service	Runtime Method	Description
`EPDeploymentService`	`getDeploymentService`	For deploying and undeploying compiled modules.
`EPEventService`	`getEventService`	For processing events and advancing time.
`EPContextPartitionService`	`getContextPartitionService`	For information about context partitions.
`EPVariableService`	`getVariableService`	For access to variables.
`EPEventTypeService`	`getEventTypeService`	For obtaining event types.
`EPFireAndForgetService`	`getFireAndForgetService`	For executing fire-and-forget queries.
`EPDataFlowService`	`getDataFlowService`	For managing data flows.
`EPMetricsService`	`getMetricsService`	For control over metrics.
`EPRenderEventService`	`getRenderEventService`	For rendering events.

You can reset a runtime by calling the initialize method. This operation resets the runtime to the configuration last provided to EPRuntimeProvider. If no configuration is provided, an empty (default) configuration applies. Your application must obtain new services from the initialized runtime as initialize marks existing services as invalid.

A runtime can be destroyed via the destroy method. This frees all resources held by the runtime. After a call to destroy the runtime can no longer be used.

You may register callbacks to receive notifications about runtime state. The runtime invokes any EPRuntimeStateListener callbacks when a runtime instance is about to be destroyed and after a runtime has been initialized. Use the addRuntimeStateListener methods to register interest.

When destroying a runtime your application must make sure that threads that are sending events into the runtime have completed their work. More generally, the runtime should not be currently in use during or after the destroy operation.

All runtime instances are completely independent. Your application may not send EventBean instances obtained from one runtime into a second runtime since the event type space between two runtimes is not shared.

15.4. Deploying and Undeploying Using `EPDeploymentService`

You application must first compile a module or obtain a compiled module before it can deploy. The object representation of a compiled module is EPCompiled.

Call the deploy method and pass the compiled module. The runtime loads the byte code and adds the information contained in the byte code, causing all the compiled module's statements to begin receiving events and time.

Deploying is an atomic operation. At deployment completion all statements of the deployment begin to see events arriving and time passing consistently. In case the deployment fails the runtime rolls back all deployment changes.

The runtime resolves dependencies of the compiled module upon its deployment. The runtime does not validate that the information about EPL-object dependencies that existed at compile-time matches the runtime EPL-objects.

For example, assume there is a compiled module by name compiledModuleM1. Deploy as follows:

EPDeployment deployment = runtime.getDeploymentService().deploy(compiledModuleM1);

The runtime returns a EPDeployment instance that contains the deployment id, the EPStatement statement instances, module name and module properties. The deployment id is an arbitrary string-type identifier that uniquely identifies the deployment in the runtime.

The undeploy method takes the deployment id and undeploys the deployment. The undeployAll method undeploys all deployments.

A compiled module may be deployed any number of times. Substitution parameters can be handy for parameterizing deployed modules.

Your application may deploy and undeploy using any thread and also within listener or subscriber code. If using Bean-style class-based events your application may not invoke deploy or undeploy methods as part of getter or setter code. Extension API code and plug-in single-row methods also may not invoke deploy or undeploy methods.

You may pass a DeploymentOptions instance. Deployment options provide deployment callbacks and other deploy-time parameters:

Provide a deployment id. If none is provided the runtime generates a unique deployment id.
Provide substitution parameter values for parameterized modules.
Provide or override statement names.
Provide a runtime statement user object that gets associated to the statement and that can be obtained from an EPStatement with getUserObjectRuntime.

Please consult the JavaDoc for more information.

15.4.1. Substitution Parameters

The compiled module may have substitution parameters as explained in the compiler documentation.

All substitution parameters must be replaced by actual values before a compiled module with substitution parameters can be deployed. A compiled module may be deployed multiple times. Substitution parameters can be set to new values for every deployment.

To set substitution parameter values pass a Deployment Options object to the deploy method that provides a StatementSubstitutionParameterOption.

If not assigning a name to substitution parameters, replace the substitution parameter with an actual value using the setObject(int index, Object value) method for each index, starting from 1.

If assigning a name to each substitution parameter, replace the substitution parameter with an actual value using the setObject(String name, Object value) method for each name.

While the setObject method allows substitution parameters to assume any actual value including application Java objects or enumeration values, the application must provide the correct type of substitution parameter that matches the requirements of the expression the parameter resides in.

The below sample code compiles and deploys a parameterized module:

String stmt = "select * from PersonEvent(firstName=?::string)";
Configuration configuration = new Configuration();
configuration.getCommon().addEventType(PersonEvent.class);
CompilerArguments compilerArguments = new CompilerArguments(configuration);
EPCompiled compiled = EPCompilerProvider.getCompiler().compile(stmt, compilerArguments);

DeploymentOptions deploymentOptions = new DeploymentOptions();
deploymentOptions.setStatementSubstitutionParameter(prepared -> prepared.setObject(1, "Joe")); 
EPDeployment deployment = runtime.getDeploymentService().deploy(compiled, deploymentOptions);
EPStatement statement = deployment.getStatements()[0];

15.4.2. Atomic Deployment Management

Your application can concurrently send events into the runtime while deploying and undeploying statements and adding or removing listeners. It is safe to undeploy and deploy compiled modules while sending in events from other threads concurrently.

However in some cases your application may need more control over deployment, for example when deploying multiple modules or when attaching custom listener code.

Your application can use the API described below to obtain a lock and perform deployment actions as an atomic unit. For example, if your application would like to undeploy and re-deploy as a single atomic unit, while at the same time sending events into the runtime from different threads, it can obtain a lock to ensure that no events are concurrently processed while the operations take place.

Note

Deploying or undeploying a single compiled module is already an atomic operation by default and does not require taking an explicit lock. If your application would like to deploy multiple compiled modules or add custom listeners or subscribers during deployment it may obtain a lock as discussed below.

The below code sample obtains the runtime exclusive write lock to perform multiple management operations as a unit, excluding concurrent processing of events.

runtime.getRuntimeInstanceWideLock().writeLock().lock();
// Start atomic management unit. 
// Any events concurrently being processed by other threads must complete before the code completes obtaining the lock. 
// Any events sent in by other threads will await the release of the lock.
try {
  // Perform operations such as : 
  //   - deploy and/or undeploy multiple compiled modules  (deployment admin API)
  //   - set statement listeners and subscribers while deploying
  // There is no need to obtain this lock when deploying or undeploying a single module.
  // The lock is reentrant and can be safely taken multiple times by the same thread.
  // Make sure you use "try" and "finally" just like we have it here.
}
finally {
  // Complete atomic management unit. 
  // Any events sent in by other threads will now continue processing against the changed set of statements.
  runtime.getRuntimeInstanceWideLock().writeLock().unlock();
}

Note

There should always be a finally block in your code to ensure the lock is released in all cases.

15.5. Obtaining Results Using `EPStatement`

A compiled module contains zero, one or multiple statements. You can attach callbacks (listeners, subscribers) to statements to receive results (aka push, observer pattern). You can iterate statement current results (aka. poll).

Each statement is uniquely identified in the runtime by the combination of deployment id and statement name. The compiler or runtime always assign a statement name if none was provided.

The EPStatement instance represents the statement. Your application receives statements when deploying a module by calling getStatements on EPDeployment.

Your application may also look up a statement by it's deployment id and statement name using the getStatement method on EPDeploymentService.

15.5.1. Receiving Statement Results

For NEsper .NET also see Section I.14, “.NET API - Receiving Statement Results”.

Esper provides three choices for your application to receive statement results. Your application can use all three mechanisms alone or in any combination for each statement. The choices are:

Table 15.2. Choices For Receiving Statement Results

Name Methods on EPStatement Description

Listener Callbacks

addListener and removeListener

Your application provides implementations of the UpdateListener interface to the statement. Listeners receive EventBean instances containing statement results.

The runtime continuously indicates results to all listeners.

Subscriber Object

setSubscriber

Requires setting the allowSubscriber option on the compiler.

Your application provides a POJO (plain Java object) that exposes methods to receive statement results.

The name of the method that a subscriber object provides to receive results is update, unless your call to setSubscriber provides another method name.

The runtime continuously indicates results to the single subscriber.

This is the fastest method to receive statement results, as the runtime delivers strongly-typed results directly to your application objects without the need for building an EventBean result set.

There can be at most one subscriber object registered per statement. If you require more than one listener, use the listener instead (or in addition). The subscriber object is bound to the statement with a strongly typed support which ensure direct delivery of new events without type conversion. This optimization is made possible because there can only be zero or one subscriber object per statement.

Pull API

safeIterator and iterator

Your application asks the statement for results and receives a set of events via java.util.Iterator<EventBean>.

This is useful if your application does not need continuous indication of new results in real-time.

Tip

The runtime calls application-provided update listeners and subscribers for output. These commonly encapsulate the actions to take when there is output. This design decouples statements from actions and places actions outside of EPL. It allows actions to change independently from statements: A statement does not need to be updated when its associated action(s) change.

While action-taking, in respect to the code or script taking action, is not a part of the EPL language, here are a few noteworthy points. Through the use of EPL annotations you can attach information to EPL that can be used by applications to flexibly determine actions. The insert into-clause can be used to send results into a further stream and input and output adapters or data flows can exist to process output events from that stream. Also the data flow EPStatementSource operator can be used to hook up actions declaratively. The DeploymentStateListener can inform your application of newly-deployed statements and currently-undeployed statements.

Your application may attach one or more listeners, zero or one single subscriber and in addition use the pull API on the same statement. There are no limitations to the use of iterator, subscriber or listener alone or in combination to receive statement results.

The best delivery performance can generally be achieved by attaching a subscriber and by not attaching listeners. The runtime is aware of the listeners and subscriber attached to a statement. The runtime uses this information internally to reduce statement overhead. For example, if your statement does not have listeners or a subscriber attached, the runtime does not need to continuously generate results for delivery.

If your application attaches both a subscriber and one or more listeners then the subscriber receives the result first before any of the listeners.

If your application attaches more than one listener then the UpdateListener listeners receive results in the order they were added to the statement. To change the order of delivery among listeners your application can add and remove listeners at runtime.

If you have configured outbound threading, it means a thread from the outbound thread pool delivers results to the subscriber and listeners instead of the processing or event-sending thread.

If outbound threading is turned on, we recommend turning off the runtime setting preserving the order of events delivered to listeners as described in Section 16.6.1.1, “Preserving the Order of Events Delivered to Listeners”. If outbound threading is turned on statement execution is not blocked for the configured time in the case a subscriber or listener takes too much time.

15.5.2. Setting a Subscriber Object

Note

The compiler option allowSubscriber must be set at compile-time.

A subscriber object is a direct binding of statement results to an object. The object, receives statement results via method invocation. The subscriber class does not need to implement an interface or extend a superclass. Only one subscriber object may be set for a statement.

Subscriber objects have several advantages over listeners. First, they offer a substantial performance benefit: Statement results are delivered directly to your method(s) through Java virtual machine method calls, and there is no intermediate representation (EventBean). Second, as subscribers receive strongly-typed parameters, the subscriber code tends to be simpler.

This chapter describes the requirements towards the methods provided by your subscriber class.

The runtime can deliver results to your subscriber in two ways:

Each evert in the insert stream results in a method invocation, and each event in the remove stream results in further method invocations. This is termed row-by-row delivery.
A single method invocation that delivers all rows of the insert and remove stream. This is termed multi-row delivery.

15.5.2.1. Using the `EPStatement` Parameter

In the case that your subscriber object wishes to receive the EPStatement instance along with output data, please add EPStatement as the very first parameter of any of the delivery method footprints that are discussed next.

For example, your statement may be:

select count(*) from OrderEvent

Your subscriber class exposes the method:

public void update(EPStatement statement, long currentCount) {...}

15.5.2.2. Row-by-Row Delivery

Your subscriber class must provide a method by name update to receive insert stream events row-by-row. The number and types of parameters declared by the update method must match the number and types of columns as specified in the select clause, in the same order as in the select clause.

For example, if your statement is:

select orderId, price, count(*) from OrderEvent

Then your subscriber update method looks as follows:

public class MySubscriber {
  ...
  public void update(String orderId, double price, long count) {...}
  ...
}

Each method parameter declared by the update method must be assignable from the respective column type as listed in the select-clause, in the order selected. The assignability rules are:

Widening of types follows Java standards. For example, if your select clause selects an integer value, the method parameter for the same column can be typed int, long, float or double (or any equivalent boxed type).
Auto-boxing and unboxing follows Java standards. For example, if your select clause selects an java.lang.Integer value, the method parameter for the same column can be typed int. Note that if your select clause column may generate null values, an exception may occur at runtime unboxing the null value.
Interfaces and super-classes are honored in the test for assignability. Therefore java.lang.Object can be used to accept any select clause column type

In the case that your subscriber class offers multiple update method footprints, the runtime selects the closest-matching footprint by comparing the output types and method parameter types. The runtime prefers the update method that is an exact match of types, followed by an update method that requires boxing or unboxing, followed by an update method that requires widening and finally any other allowable update method.

Within the above criteria, in the case that your subscriber class offers multiple update method footprints with same method parameter types, the runtime prefers the update method that has EPStatement as the first parameter.

15.5.2.2.1. Wildcards

If your select clause contains one or more wildcards (*), then the equivalent parameter type is the underlying event type of the stream selected from.

For example, your statement may be:

select *, count(*) from OrderEvent

Then your subscriber update method looks as follows:

public void update(OrderEvent orderEvent, long count) {...}

In a join, the wildcard expands to the underlying event type of each stream in the join in the order the streams occur in the from clause. An example statement for a join is:

select *, count(*) from OrderEvent order, OrderHistory hist

Then your subscriber update method should be:

public void update(OrderEvent orderEvent, OrderHistory orderHistory, long count) {...}

The stream wildcard syntax and the stream name itself can also be used:

select hist.*, order from OrderEvent order, OrderHistory hist

The matching update method is:

public void update(OrderHistory orderHistory, OrderEvent orderEvent) {...}

15.5.2.2.2. Row Delivery as Map and Object Array

Alternatively, your update method may simply choose to accept java.util.Map as a representation for each row. Each column in the select clause is then made an entry in the resulting Map. The Map keys are the column name if supplied, or the expression string itself for columns without a name.

The update method for Map delivery is:

public void update(Map row) {...}

The runtime also supports delivery of select clause columns as an object array. Each item in the object array represents a column in the select clause. The update method then looks as follows:

public void update(Object[] row) {...}

15.5.2.2.3. Delivery of Remove Stream Events

Your subscriber receives remove stream events if it provides a method named updateRStream. The method must accept the same number and types of parameters as the update method (including EPStatement if present).

An example statement:

select orderId, count(*) from OrderEvent#time(20 sec) group by orderId

Then your subscriber update and updateRStream methods should be:

public void update(String, long count) {...}
public void updateRStream(String orderId, long count) {...}

15.5.2.2.4. Delivery of Begin and End Indications

If your subscriber requires a notification for begin and end of event delivery, it can expose methods by name updateStart and updateEnd.

The updateStart method must take two integer parameters that indicate the number of events of the insert stream and remove stream to be delivered. The runtime invokes the updateStart method immediately prior to delivering events to the update and updateRStream methods.

The updateEnd method must take no parameters. The runtime invokes the updateEnd method immediately after delivering events to the update and updateRStream methods.

An example set of delivery methods:

// Called by the runtime before delivering events to update methods
public void updateStart(int insertStreamLength, int removeStreamLength)

// To deliver insert stream events
public void update(String orderId, long count) {...}

// To deliver remove stream events
public void updateRStream(String orderId, long count) {...}

// Called by the runtime after delivering events
public void updateEnd() {...}

15.5.2.3. Multi-Row Delivery

In place of row-by-row delivery, your subscriber can receive all events in the insert and remove stream via a single method invocation. This is applicable when an EPL delivers multiple output rows for a given input event or time advancing, for example when multiple pattern matches occur for the same incoming event, for a join producing multiple output rows or with output rate limiting, for example.

The event delivery follow the scheme as described earlier in Section 15.5.2.2.2, “Row Delivery as Map and Object Array ”. The subscriber class must provide one of the following methods:

Table 15.3. Update Method for Multi-Row Delivery of Underlying Events

Method	Description
`update(Object[][] insertStream, Object[][] removeStream)`	The first dimension of each Object array is the event row, and the second dimension is the column matching the column order of the statement `select` clause
`update(Map[] insertStream, Map[] removeStream)`	Each map represents one event, and Map entries represent columns of the statement `select` clause

15.5.2.3.1. Wildcards

If your select clause contains a single wildcard (*) or wildcard stream selector, the subscriber object may also directly receive arrays of the underlying events. In this case, the subscriber class should provide a method update(Underlying[] insertStream, Underlying[] removeStream) , such that Underlying represents the class of the underlying event.

For example, your statement may be:

select * from OrderEvent#time(30 sec)

Your subscriber class exposes the method:

public void update(OrderEvent[] insertStream, OrderEvent[] removeStream) {...}

15.5.2.4. No-Parameter Update Method

In the case that your subscriber object wishes to receive no data from a statement please follow the instructions here.

You statement must select a single null value.

For example, your statement may be:

select null from OrderEvent(price > 100)

Your subscriber class exposes the method:

public void update() {...}

15.5.3. Adding Listeners

For NEsper .NET also see Section I.15, “.NET API - Adding Listeners”.

Your application can subscribe to updates posted by a statement via the addListener and removeListener methods on EPStatement . Your application must to provide an implementation of the UpdateListener interface to the statement:

UpdateListener myListener = new MyUpdateListener();
countStmt.addListener(myListener);

Statements publish old data and new data to registered UpdateListener listeners. New data published by statements is the events representing the new values of derived data held by the statement. Old data published by statements consists of the events representing the prior values of derived data held by the statement.

Important

UpdateListener listeners receive multiple result rows in one invocation by the runtime: the new data and old data parameters to your listener are array parameters. For example, if your application uses one of the batch data windows, or your application creates a pattern that matches multiple times when a single event arrives, then the runtime indicates such multiple result rows in one invocation and your new data array carries two or more rows.

To indicate results the runtime invokes the following method on UpdateListener listeners: update(EventBean[] newEvents, EventBean[] oldEvents, EPStatement statement, EPRuntime runtime)

15.5.3.1. Subscription Snapshot and Atomic Delivery

The addListenerWithReplay method provided by EPStatement makes it possible to send a snapshot of current statement results to a listener when the listener is added.

When using the addListenerWithReplay method to register a listener, the listener receives current statement results as the first call to the update method of the listener, passing in the newEvents parameter the current statement results as an array of zero or more events. Subsequent calls to the update method of the listener are statement results.

Current statement results are the events returned by the iterator or safeIterator methods.

Delivery is atomic: Events occurring during delivery of current results to the listener are guaranteed to be delivered in a separate call and not lost. The listener implementation should thus minimize long-running or blocking operations to reduce lock times held on statement-level resources.

15.5.4. Using Iterators

Subscribing to events posted by a statement is following a push model. The runtime pushes data to listeners when events are received that cause data to change or patterns to match. Alternatively, you need to know that statements serve up data that your application can obtain via the safeIterator and iterator methods on EPStatement. This is called the pull API and can come in handy if your application is not interested in all new updates, and only needs to perform a frequent or infrequent poll for the latest data.

The safeIterator method on EPStatement returns a concurrency-safe iterator returning current statement results, even while concurrent threads may send events into the runtime for processing. The runtime employs a read-write lock per context partition and obtains a read lock for iteration. Thus safe iterator guarantees correct results even as events are being processed by other threads and other context partitions. The cost is that the iterator obtains and holds zero, one or multiple context partition locks for that statement that must be released via the close method on the SafeIterator instance.

The iterator method on EPStatement returns a concurrency-unsafe iterator. This iterator is only useful for applications that are single-threaded, or applications that themselves perform coordination between the iterating thread and the threads that send events into the runtime for processing. The advantage to this iterator is that it does not hold a lock.

When statements are used with contexts and context partitions, the APIs to identify, filter and select context partitions for statement iteration are described in Section 15.16, “Context Partition Selection”.

The next code snippet shows a short example of use of safe iterators:

EPStatement statement = epAdmin.createEPL("select avg(price) as avgPrice from MyTick");
// .. send events into the runtime
// then use the pull API...
SafeIterator<EventBean> safeIter = statement.safeIterator();
try {
  for (;safeIter.hasNext();) {
     // .. process event ..
     EventBean event = safeIter.next();
     System.out.println("avg:" + event.get("avgPrice");
  }
}
finally {
  safeIter.close();	// Note: safe iterators must be closed
}

This is a short example of use of the regular iterator that is not safe for concurrent event processing:

double averagePrice = (Double) eplStatement.iterator().next().get("average");

The safeIterator and iterator methods can be used to pull results out of all statements, including statements that join streams, contain aggregation functions, pattern statements, and statements that contain a where clause, group by clause, having clause or order by clause.

For statements without an order by clause, the iterator method returns events in the order maintained by the data window. For statements that contain an order by clause, the iterator method returns events in the order indicated by the order by clause.

Consider using the on-select clause and a named window if your application requires iterating over a partial result set or requires indexed access for fast iteration; Note that on-select requires that you sent a trigger event, which may contain the key values for indexed access.

Esper places the following restrictions on the pull API and usage of the safeIterator and iterator methods:

In multithreaded applications, use the safeIterator method. Note: make sure your application closes the iterator via the close method when done, otherwise the iterated statement context partitions stay locked and event processing for statement context partitions does not resume.
In multithreaded applications, the iterator method does not hold any locks. The iterator returned by this method does not make any guarantees towards correctness of results and fail-behavior, if your application processes events into the runtime by multiple threads. Use the safeIterator method for concurrency-safe iteration instead.
Since the safeIterator and iterator methods return events to the application immediately, the iterator does not honor an output rate limiting clause, if present. That is, the iterator returns results as if there is no output-rate clause for the statement in statements without grouping or aggregation. For statements with grouping or aggregation, the iterator in combination with an output clause returns last output group and aggregation results. Use a separate statement and the insert into clause to control the output rate for iteration, if so required.
When iterating a statement that operates on an unbound stream (no data window declared), please note the following:
- When iterating a statement that groups and aggregates values from an unbound stream and that specifies output snapshot, the runtime retains groups and aggregations for output as iteration results or upon the output snapshot condition .
- When iterating a statement that groups and aggregates values from an unbound stream and that does not specify output snapshot, the runtime only retains the last aggregation values and the iterated result contains only the last updated group.
- When iterating a statement that operates on an unbound stream the iterator returns no rows. This behavior can be changed by specifying either the @IterableUnbound annotation or by changing the global view resources configuration.

15.5.5. Event and Event Type

An EventBean object represents a row (event) in your statement's result set. Each EventBean object has an associated EventType object providing event metadata.

An UpdateListener implementation receives one or more EventBean events with each invocation. Via the iterator method on EPStatement your application can poll or read data out of statements. Statement iterators also return EventBean instances.

Each statement provides the event type of the events it produces, available via the getEventType method on EPStatement.

15.5.5.1. Event Type Metadata

An EventType object encapsulates all the metadata about a certain type of events. As Esper supports an inheritance hierarchy for event types, it also provides information about super-types to an event type.

An EventType object provides the following information:

For each event property, it lists the property name and type as well as flags for indexed or mapped properties and whether a property is a fragment.
The direct and indirect super-types to the event type.
Value getters for property expressions.
Underlying class of the event representation.

For each property of an event type, there is an EventPropertyDescriptor object that describes the property. The EventPropertyDescriptor contains flags that indicate whether a property is an indexed (array) or a mapped property and whether access to property values require an integer index value (indexed properties only) or string key value (mapped properties only). The descriptor also contains a fragment flag that indicates whether a property value is available as a fragment.

The term fragment means an event property value that is itself an event, or a property value that can be represented as an event. The getFragmentType on EventType may be used to determine a fragment's event type in advance.

A fragment event type and thereby fragment events allow navigation over a statement's results even if the statement result contains nested events or a graph of events. There is no need to use the Java reflection API to navigate events, since fragments allow the querying of nested event properties or array values, including nested Java classes.

When using the Map or Object-array event representation, any named Map type or Object-array type nested within a Map or Object-array as a simple or array property is also available as a fragment. When using Java objects either directly or within Map or Object-array events, any object that is neither a primitive or boxed built-in type, and that is not an enumeration and does not implement the Map interface is also available as a fragment.

The nested, indexed and mapped property syntax can be combined to a property expression that may query an event property graph. Most of the methods on the EventType interface allow a property expression to be passed.

Your application may use an EventType object to obtain special getter-objects. A getter-object is a fast accessor to a property value of an event of a given type. All getter objects implement the EventPropertyGetter interface. Getter-objects work only for events of the same type or sub-types as the EventType that provides the EventPropertyGetter. The performance section provides additional information and samples on using getter-objects.

15.5.5.2. Event Object

An event object is an EventBean that provides:

The property value for a property given a property name or property expression that may include nested, indexed or mapped properties in any combination.
The event type of the event.
Access to the underlying event object.
The EventBean fragment or array of EventBean fragments given a property name or property expression.

The getFragment method on EventBean and EventPropertyGetter return the fragment EventBean or array of EventBean, if the property is itself an event or can be represented as an event. Your application may use EventPropertyDescriptor to determine which properties are also available as fragments.

The underlying event object of an EventBean can be obtained via the getUnderlying method. Please see Chapter 3, Event Representations for more information on different event representations.

From a threading perspective, it is safe to retain and query EventBean and EventType objects in multiple threads.

15.5.5.3. Query Example

Consider a statement that returns the symbol, count of events per symbol and average price per symbol for tick events. Our sample statement uses the event type: StockTickEvent. Assume that this event type was declared previously and exposes a symbol property of type String and a price property of type (Java primitive) double.

select symbol, avg(price) as avgprice, count(*) as mycount 
from StockTickEvent 
group by symbol

The next table summarizes the property names and types as posted by the statement above:

Table 15.4. Properties Offered by Sample Statement Aggregating Price

Name	Type	Description	Java code snippet
`symbol`	java.lang.String	Value of symbol event property	eventBean.get("symbol")
`avgprice`	java.lang.Double	Average price per symbol	eventBean.get("avgprice")
`mycount`	java.lang.Long	Number of events per symbol	eventBean.get("mycount")

A code snippet out of a possible UpdateListener implementation to this statement may look as below:

String symbol = (String) newEvents[0].get("symbol");
Double price= (Double) newEvents[0].get("avgprice");
Long count= (Long) newEvents[0].get("mycount");

The runtime supplies the boxed java.lang.Double and java.lang.Long types as property values rather than primitive Java types. This is because aggregated values can return a null value to indicate that no data is available for aggregation. Also, in a select statement that computes expressions, the underlying event objects to EventBean instances are either of type Object[] (object-array) or of type java.util.Map.

Use statement.getEventType().getUnderlyingType() to inspect the underlying type for all events delivered to listeners. Whether the runtime delivers Map or Object-array events to listeners can be specified as follows. If the statement provides the @EventRepresentation(objectarray) annotation the runtime delivers the output events as object array. If the statement provides the @EventRepresentation(map) annotation the runtime delivers output events as a Map. If neither annotation is provided, the runtime delivers the configured default event representation as discussed in Section 16.4.8.1, “Default Event Representation”.

Consider the next statement that specifies a wildcard selecting the same type of event:

select * from StockTickEvent where price > 100

The property names and types provided by an EventBean query result row, as posted by the statement above are as follows:

Table 15.5. Properties Offered by Sample Wildcard-Select Statement

Name	Type	Description	Java code snippet
`symbol`	java.lang.String	Value of symbol event property	eventBean.get("symbol")
`price`	double	Value of price event property	eventBean.get("price")

As an alternative to querying individual event properties via the get methods, the getUnderlying method on EventBean returns the underlying object representing the statement result. In the sample statement that features a wildcard-select, the underlying event object is of type org.sample.StockTickEvent:

StockTickEvent tick = (StockTickEvent) newEvents[0].getUnderlying();

15.5.5.4. Pattern Example

Composite events are events that aggregate one or more other events. Composite events are typically created by the runtime for statements that join two event streams, and for event patterns in which the causal events are retained and reported in a composite event. The example below shows such an event pattern.

// Look for a pattern where BEvent follows AEvent
select * from pattern [a=AEvent -> b=BEvent]

// Example listener code
public class MyUpdateListener implements UpdateListener {
  public void update(EventBean[] newData, EventBean[] oldData, EPStatement statement, EPRuntime runtime) {
    System.out.println("a event=" + newData[0].get("a"));
    System.out.println("b event=" + newData[0].get("b"));
  }
}

Note that the update method can receive multiple events at once as it accepts an array of EventBean instances. For example, a time batch window may post multiple events to listeners representing a batch of events received during a given time period.

Pattern statements can also produce multiple events delivered to update listeners in one invocation. The pattern statement below, for instance, delivers an event for each A event that was not followed by a B event with the same id property within 60 seconds of the A event. The runtime may deliver all matching A events as an array of events in a single invocation of the update method of each listener to the statement:

select * from pattern[every a=A -> (timer:interval(60 sec) and not B(id=a.id))]

A code snippet out of a possible UpdateListener implementation to this statement that retrives the events as fragments may look as below:

EventBean a = (EventBean) newEvents[0].getFragment("a");
// ... or using a nested property expression to get a value out of A event...
double value = (Double) newEvent[0].get("a.value");

Some pattern objects return an array of events. An example is the unbound repeat operator. Here is a sample pattern that collects all A events until a B event arrives:

select * from pattern [a=A until b=B]

A possible code to retrieve different fragments or property values:

EventBean[] a = (EventBean[]) newEvents[0].getFragment("a");
// ... or using a nested property expression to get a value out of A event...
double value = (Double) newEvent[0].get("a[0].value");

15.5.6. Interrogating Annotations

As discussed in Section 5.2.7, “Annotation” an EPL annotation is an addition made to statement information. The API and examples to interrogate annotations are described here.

You may use the getAnnotations method of EPStatement to obtain annotations specified for a statement. Or when compiling an EPL expression to a EPStatementObjectModel statement object model you may also query, change or add annotations.

The following example code demonstrates iterating over an EPStatement statement's annotations and retrieving values:

String exampleEPL = "@Tag(name='direct-output', value='sink 1') select * from RootEvent";
Configuration configuration = new Configuration();
configuration.getCommon().addEventType("RootEvent", Collections.emptyMap()); // add an event type without properties
CompilerArguments compilerArguments = new CompilerArguments(configuration);
EPCompiled compiled = EPCompilerProvider.getCompiler().compile(stmt, compilerArguments);

EPDeployment deployment = runtime.getDeploymentService().deploy(compiled);
EPStatement stmt = deployment.getStatements()[0];
for (Annotation annotation : stmt.getAnnotations()) {
  if (annotation instanceof Tag) {
    Tag tag = (Tag) annotation;
    System.out.println("Tag name " + tag.name() + " value " + tag.value());
  }
}

The output of the sample code shown above is Tag name direct-output value sink 1.

15.6. Processing Events and Time Using `EPEventService`

The EPEventService interface is used to send events and advance time. Obtain the event service from a runtime by calling getEventService on EPRuntime.

This section focuses on processing events. For more information on controlling time using the event service please skip forward to Section 15.9, “Controlling Time-Keeping”.

Your application invokes any of the sendEventType methods listed below and must provide an event type name along with the actual event object:

Table 15.6. Send-Event Methods

Method	Description
sendEventBean(Object event, String eventTypeName)	Call when the event is a Bean-style event. The event type name should be associated to a class event representation.
sendEventMap(Map<String, Object> event, String eventTypeName)	Call when the event is a map. The event type name should be associated to a map event representation.
sendEventObjectArray(Object[] event, String eventTypeName);	Call when the event is an object-array. The event type name should be associated to an object-array event representation.
sendEventXMLDOM(Node node, String eventTypeName);	Call when the event is a DOM-Node. The event type name should be associated to an XML event representation.
void sendEventAvro(Object avroGenericDataDotRecord, String avroEventTypeName);	Call when the event is an Avro object. The event type name should be associated to an Avro event representation.

The Chapter 3, Event Representations section explains the types of event representations.

The below sample code assumes that the event type name MarketDataBean refers to a class event representation that matches the class MarketDataBean:

EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime();
EPEventService eventService = runtime.getEventService();

// Send an example event containing stock market data
eventService.sendEventBean(new MarketDataBean("IBM", 75.0), "MarketDataBean");

Tip

Events, in theoretical terms, are observations of a state change that occurred in the past. Since you cannot change an event that happened in the past, events are best modelled as immutable objects.

Caution

The runtime relies on events that are sent into the runtime to not change their state. Typically, applications create a new event object for every new event, to represent that new event. Application should not modify an existing event that was sent into the runtime.

Important

Another important method in the runtime interface are the routeEventType methods. These methods are designed for use by UpdateListener and subscriber implementations as well as runtime extensions that need to send events into a runtime to avoid the possibility of a stack overflow due to nested calls to sendEvent and to ensure correct processing of the current and routed event. Note that if outbound-threading is enabled, listeners and subscribers should use sendEvent and not routeEvent.

15.6.1. Event Sender

The EventSender interface processes event objects that are of a known type. This facility can reduce the overhead of event object reflection and type lookup as an event sender is always associated to a single concrete event type.

Use the method getEventSender(String eventTypeName) to obtain an event sender for processing events of the named type:

EventSender sender = runtime.getEVentService().getEventSender("MyEvent");
sender.sendEvent(myEvent);

For events backed by a Java class (JavaBean events), the event sender ensures that the event object equals the underlying class, or implements or extends the underlying class for the given event type name.

For events backed by a java.util.Map (Map events), the event sender does not perform any checking other than checking that the event object implements Map.

For events backed by a Object[] (Object-array events), the event sender does not perform any checking other than checking that the event object implements Object[]. The array elements must be in the exact same order of properties as declared and array length must always be at least the number of properties declared.

For events backed by a Apache Avro GenericData.Record, the event sender does not perform any checking other than checking that the event object is a GenericData.Record. The schema associated to the record should match the event type's Avro schema.

For events backed by a org.w3c.Node (XML DOM events), the event sender checks that the root element name equals the root element name for the event type.

15.6.2. Receiving Unmatched Events

Your application can register an implementation of the UnmatchedListener interface with the event service by calling the setUnmatchedListener method to receive events that were not matched by any statement.

Events that can be unmatched are all events that your application sends into the runtime via one of the sendEvent or routeEvent methods, or that have been generated via an insert into clause.

For an event to become unmatched by any statement, the event must not match any statement's event stream filter criteria. Note that the EPL where clause or having clause are not considered part of the filter criteria for a stream, as explained by example below.

In the following statement a MyEvent event with a 'quantity' property value of 5 or less does not match this statement's event stream filter criteria. The runtime delivers such an event to the registered UnmatchedListener instance provided no other statement matches on the event:

select * from MyEvent(quantity > 5)

For patterns, if no pattern sub-expression is active for an event type, an event of that type also counts as unmatched in regards to the pattern statement.

15.7. Execute Fire-and-Forget Queries Using `EPFireAndForgetService`

The EPFireAndForgetService interface offers methods to execute fire-and-forget queries. Obtain the fire-and-forget service from a runtime by calling getFireAndForgetService on EPRuntime.

As your application may not require streaming results and may not know each statement in advance, the fire-and-forget query facility provides for ad-hoc on-demand execution of an EPL query.

Fire-and-forget queries are not continuous in nature: The fire-and-forget query runtime executes the query once and returns all result rows to the application. Fire-and-forget query execution is very lightweight as the runtime performs no statement deployment and the query leaves no traces within the runtime.

Esper provides the facility to explicitly index named windows and tables to speed up fire-and-forget queries and statements. Please consult Section 6.9, “Explicitly Indexing Named Windows and Tables” for more information.

When named windows and tables are used with contexts and context partitions, the APIs to identify, filter and select context partitions for fire-and-forget queries can be found in Section 15.16, “Context Partition Selection”.

There are three ways to run fire-and-forget queries:

Use the executeQuery method to executes a given fire-and-forget query exactly once, see Section 15.7.1, “Fire-and-forget Query Single Execution”.
Use the prepareQuery method to prepare a given fire-and-forget query such that the same query can be executed multiple times, see Section 15.7.2, “Fire-and-forget Query Prepared Unparameterized Execution”.
Use the prepareQueryWithParameters method to prepare a given fire-and-forget query that may have substitution parameters such that the same query can be parameterized and executed multiple times without repeated parsing, see Section 15.7.3, “Fire-and-forget Query Prepared Parameterized Execution”

If your application must execute the same fire-and-forget query multiple times with different parameters use prepareQueryWithParameters.

If your application must execute the same fire-and-forget query multiple times without parameters use either prepareQuery or prepareQueryWithParameters and specify no substitution parameters.

By using any of the prepare... methods the runtime can load the byte code for the query once and reuse the byte code and thereby speed up repeated execution.

The following limitations apply:

A fire-and-forget only evaluates against the named windows and tables that your application creates. Fire-and-forget queries may not specify any other streams or application event types.
The following clauses are not allowed in fire-and-forget EPL queries: insert into and output.
Data windows and patterns are not allowed to appear in fire-and-forget queries.
Fire-and-forget EPL may not perform subqueries.
The previous and prior functions may not be used.

15.7.1. Fire-and-forget Query Single Execution

Use the executeQuery method for executing a fire-and-forget query once. For repeated execution, please consider any of the prepare... methods instead.

The next program listing runs a fire-and-forget query against a named window MyNamedWindow and prints a column of each row result of the query (this sample uses the compiler runtime-path):

String query = "select * from MyNamedWindow";
CompilerArguments compilerArguments = new CompilerArguments();
compilerArguments.getPath().add(runtime.getRuntimePath());
EPCompiled compiled = EPCompilerProvider.getCompiler().compileQuery(query, compilerArguments);

EPFireAndForgetQueryResult result = runtime.getFireAndForgetService().executeQuery(compiled);
for (EventBean row : result.getArray()) {
  System.out.println("name=" + row.get("name"));
}

For executing a fire-and-forget against a table please put the table name into the from-clause instead.

15.7.2. Fire-and-forget Query Prepared Unparameterized Execution

Prepared fire-and-forget queries are designed for repeated execution and may perform better then the dynamic single-execution method if running the same query multiple times. For use with parameter placeholders please see Section 15.7.3, “Fire-and-forget Query Prepared Parameterized Execution”.

The next code snippet demonstrates prepared fire-and-forget queries without parameter placeholder:

String query = "select * from MyNamedWindow where orderId = '123'";
CompilerArguments compilerArguments = new CompilerArguments();
compilerArguments.getPath().add(runtime.getRuntimePath());
EPCompiled compiled = EPCompilerProvider.getCompiler().compileQuery(query, compilerArguments);
						
EPFireAndForgetPreparedQuery prepared = runtime.getFireAndForgetService().prepareQuery(compiled);
EPFireAndForgetQueryResult result = prepared.execute();

// ...later on execute once more ...
prepared.execute();	// execute a second time

15.7.3. Fire-and-forget Query Prepared Parameterized Execution

Please see the compiler documentation for specifying substitution parameters.

All substitution parameters must be replaced by actual values before a fire-and-forget query with substitution parameters can be executed. Substitution parameters can be replaced with an actual value using the setObject method for each index or name. Substitution parameters can be set to new values and the query executed more than once.

While the setObject method allows substitution parameters to assume any actual value including application Java objects or enumeration values, the application must provide the correct type of substitution parameter that matches the type that was specified, if any, and the requirements of the expression the parameter resides in.

The next program listing runs a prepared and parameterized fire-and-forget query against a named window MyNamedWindow (this example does not assign names to substitution parameters):

String query = "select * from MyNamedWindow where orderId = ?::string";
CompilerArguments compilerArguments = new CompilerArguments();
compilerArguments.getPath().add(runtime.getRuntimePath());
EPCompiled compiled = EPCompilerProvider.getCompiler().compileQuery(query, compilerArguments);

EPFireAndForgetPreparedQueryParameterized prepared = runtime.getFireAndForgetService().prepareQueryWithParameters(query);

// Set the required parameter values before each execution
prepared.setObject(1, "123");
EPFireAndForgetQueryResult result = runtime.getFireAndForgetService().executeQuery(prepared);

// ...execute a second time with new parameter values...
prepared.setObject(1, "456");
result = runtime.getFireAndForgetService().executeQuery(prepared);

This second example uses the in operator and has multiple parameters:

String query = "select * from MyNamedWindow where orderId in (?::string[]) and price > ?::double";
CompilerArguments compilerArguments = new CompilerArguments();
compilerArguments.getPath().add(runtime.getRuntimePath());
EPCompiled compiled = EPCompilerProvider.getCompiler().compileQuery(query, compilerArguments);

EPFireAndForgetPreparedQueryParameterized prepared = runtime.getFireAndForgetService().prepareQueryWithParameters(compiled);
prepared.setObject(1, new String[] {"123", "456"});
prepared.setObject(2, 1000.0);

15.8. Runtime Threading and Concurrency

For NEsper .NET also see Section I.16, “.NET API - Runtime Threading and Concurrency”.

The runtime is designed from the ground up to operate as a component to multi-threaded, highly-concurrent applications that require efficient use of Java VM resources. In addition, multi-threaded execution requires guarantees in predictability of results and deterministic processing. This section discusses these concerns in detail.

In Esper, a runtime instance is a unit of separation. Applications can obtain and discard (initialize) one or more runtime instances within the same Java VM and can provide the same or different configurations to each instance. A runtime instance shares resources between statements by means of named windows, tables and variables.

Applications can use Esper APIs to concurrently, by multiple threads of execution, perform such functions as deploying modules, or sending events into the runtime for processing. Applications can use application-managed threads or thread pools or any set of same or different threads of execution with any of the public runtime APIs. There are no restrictions towards threading other than those noted in specific sections of this document.

The runtime does not prescribe a specific threading model. Applications using Esper retain full control over threading, allowing a runtime to be easily embedded and used as a component or library in your favorite Java container or process.

In the default configuration it is up to the application code to use multiple threads for processing events by the runtime, if so desired. All event processing takes places within your application thread call stack. The exception is timer-based processing if your runtime relies on the internal timer (default). If your application relies on external timer events instead of the internal timer then there need not be any runtime-managed internal threads.

The fact that event processing can take place within your application thread's call stack makes developing applications with the Esper runtime easier: Any common Java integrated development environment (IDE) can host a compiler and runtime instance. This allows developers to easily set up test cases, debug through listener code and inspect input or output events, or trace their call stack.

In the default configuration, each runtime maintains a single timer thread (internal timer) providing for time or schedule-based processing within the runtime. The default resolution at which the internal timer operates is 100 milliseconds. The internal timer thread can be disabled and applications can instead advance time to perform timer or scheduled processing at the resolution required by an application.

A runtime performs minimal locking to enable high levels of concurrency. A runtime locks on the combination of query and context partition to protect context partition resources. For example, two statements with three partitions each have a total of six locks. For stateless EPL select-statements the runtime does not use a context-partition lock and operates lock-free for the context partition. For stateful statements, the maximum (theoretical) degree of parallelism is 2^31-1 (2,147,483,647) parallel threads working to process a single statement under a hash segmented context.

For named windows and tables, on-select, on-merge, on-update and on-delete all execute under the same lock as the partition of the named window or table. Any insert into produces a new event for the work queue and does not lock the target of the insert-into.

You may turn off context partition locking runtime-wide (also read the caution notice) as described in Section 16.6.10.3, “Disable Locking”. You may disable context partition locking for a given statement by providing the @NoLock annotation as part of your EPL. Note, Esper provides the @NoLock annotation for the purpose of identifying locking overhead, or when your application is single-threaded, or when using an external mechanism for concurrency control or for example with virtual data windows or plug-in data windows to allow customizing concurrency for a given statement or named window. Using this annotation may have unpredictable results unless your application is taking concurrency under consideration.

For a runtime to produce predictable results from the viewpoint of listeners to statements, a runtime by default ensures that it dispatches statement result events to listeners in the order in which a statement produced result events. Applications that require the highest possible concurrency and do not require predictable order of delivery of events to listeners, this feature can be turned off via configuration, see Section 16.6.1.1, “Preserving the Order of Events Delivered to Listeners”. For example, assume thread T1 processes an event applied to statement S producing output event O1. Assume thread T2 processes another event applied to statement S and produces output event O2. The runtime employs a configurable latch system to ensure that listeners to statement S receive and may complete processing of O1 before receiving O2. When using outbound threading (advanced threading options) or changing the configuration this guarantee is weakened or removed.

In multithreaded environments, when one or more statements make result events available via the insert into clause to further statements, the runtime preserves the order of events inserted into the generated insert-into stream, allowing statements that consume other statement's events to behave deterministic. This feature can also be turned off via configuration, see Section 16.6.1.2, “Preserving the Order of Events for Insert-Into Streams”. For example, assume thread T1 processes an event applied to statement S and thread T2 processes another event applied to statement S. Assume statement S inserts into stream ST. T1 produces an output event O1 for processing by consumers of ST1 and T2 produces an output event O2 for processing by consumers of ST. The runtime employs a configurable latch system such that O1 is processed before O2 by consumers of ST. When using route execution threading (advanced threading options) or changing the configuration this guarantee is weakened or removed.

We generally recommended that listener implementations block minimally or do not block at all. By implementing listener code as non-blocking code execution threads can often achieve higher levels of concurrency.

We recommended that, when using a single listener or subscriber instance to receive output from multiple statements, that the listener or subscriber code is multithread-safe. If your application has shared state between listener or subscriber instances then such shared state should be thread-safe.

15.8.1. Advanced Threading

In the default configuration the same application thread that invokes any of the sendEvent methods will process the event fully and also deliver output events to listeners and subscribers. By default the single internal timer thread based on system time performs time-based processing and delivery of time-based results.

This default configuration reduces the processing overhead associated with thread context switching, is lightweight and fast and works well in many environments such as J2EE, server or client. Latency and throughput requirements are largely use case dependent, and Esper provides runtime-level facilities for controlling concurrency that are described next.

Inbound Threading queues all incoming events: A pool of runtime-managed threads performs the event processing. The application thread that sends an event via any of the sendEvent methods returns without blocking.

Outbound Threading queues events for delivery to listeners and subscribers, such that slow or blocking listeners or subscribers do not block event processing.

Timer Execution Threading means time-based event processing is performed by a pool of runtime-managed threads. With this option the internal timer thread (or external timer event) serves only as a metronome, providing units-of-work to the runtime-managed threads in the timer execution pool, pushing threading to the level of each statement for time-based execution.

Route Execution Threading means that the thread sending in an event via any of the sendEvent methods (or the inbound threading pooled thread if inbound threading is enabled) only identifies and pre-processes an event, and a pool of runtime-managed threads handles the actual processing of the event for each statement, pushing threading to the level of each statement for event-arrival-based execution.

The runtime starts runtime-managed threads as daemon threads when the runtime instance is first obtained. The runtime stops runtime-managed threads when the runtime instance is destroyed via the destroy method. When the runtime is initialized via the initialize method the existing runtime-managed threads are stopped and new threads are created. When shutting down your application, use the destroy method to stop runtime-managed threads.

Note that the options discussed herein may introduce additional processing overhead into your system, as each option involves work queue management and thread context switching.

Note

If your use cases require ordered processing of events or do not tolerate disorder, the threading options described herein are not the right choice.

For enforcing a processing order within a given criteria, your application must enforce such processing order. Esper does not enforce order of processing if you enable inbound or route threading. Your application code could, for example, utilize a thread per group of criteria keys, a latch per criteria key, or a queue per criteria key, or use Java's completion service, all depending on your ordering requirements.

If your use cases require loss-less processing of events, wherein the threading options mean that events are held in an in-memory queue, the threading options described herein may not be the right choice.

Care should be taken to consider arrival rates and queue depth. Threading options utilize unbound queues or capacity-bound queues with blocking-put, depending on your configuration, and may therefore introduce an overload or blocking situation to your application. You may use the service provider interface as outlined below to manage queue sizes, if required, and to help tune the runtime to your application needs. Consider throttling down the event send rate when the API (see below) indicates that events are getting queued.

All threading options are on the level of a runtime. If you require different threading behavior for certain statements then consider using multiple runtimes, consider using the routeEvent method or consider using application threads instead.

Please consult Section 16.6.1, “Runtime Settings Related to Concurrency and Threading” for instructions on how to configure threading options. Threading options take effect at runtime initialization time.

15.8.1.1. Inbound Threading

With inbound threading a runtime places inbound events in a queue for processing by one or more runtime-managed threads other than the delivering application threads.

The delivering application thread uses one of the sendEventType methods on EPEventService to deliver events or may also use the sendEvent method on a EventSender. The runtime receives the event and places the event into a queue, allowing the delivering thread to continue and not block while the event is being processed and results are delivered.

Events that are sent into the runtime via one of the routeEvent methods are not placed into queue but processed by the same thread invoking the routeEvent operation.

15.8.1.2. Outbound Threading

With outbound threading a runtime places outbound events in a queue for delivery by one or more runtime-managed threads other than the processing thread originating the result.

With outbound threading your listener or subscriber class receives statement results from one of the runtime-managed threads in the outbound pool of threads. This is useful when you expect your listener or subscriber code to perform significantly blocking operations and you do not want to hold up event processing.

Note

If outbound-threading is enabled, listeners and subscribers that send events back into the runtime should use the sendEvent method and not the routeEvent method.

15.8.1.3. Timer Execution Threading

With timer execution threading an runtime places time-based work units into a queue for processing by one or more runtime-managed threads other than the internal timer thread or the application thread that sends an external timer event.

Using timer execution threading the internal timer thread (or thread delivering an external timer event) serves to evaluate which time-based work units must be processed. A pool of runtime-managed threads performs the actual processing of time-based work units and thereby offloads the work from the internal timer thread (or thread delivering an external timer event).

Enable this option as a tuning parameter when your statements utilize time-based patterns or data windows. Timer execution threading is fine grained and works on the level of a time-based schedule in combination with a statement.

15.8.1.4. Route Execution Threading

With route execution threading an runtime identifies event-processing work units based on the event and statement combination. It places such work units into a queue for processing by one or more runtime-managed threads other than the thread that originated the event.

While inbound threading works on the level of an event, route execution threading is fine grained and works on the level of an event in combination with a statement.

15.8.1.5. Threading Service Provider Interface

The service-provider interface EPRuntimeSPI is an extension API that allows to manage runtime-level queues and thread pools (Extension APIs are subject to change between release versions).

The following code snippet shows how to obtain the BlockingQueue<Runnable> and the ThreadPoolExecutor for the managing the queue and thread pool responsible for inbound threading:

EPRuntimeSPI spi = (EPRuntimeSPI) runtime;
int queueSize = spi.getThreadingService().getInboundQueue().size();
ThreadPoolExecutor threadpool = spi.getThreadingService().getInboundThreadPool();

15.8.2. Processing Order

15.8.2.1. Competing Statements

This section discusses the order in which N competing statements that all react to the same arriving event execute.

The runtime, by default, does not guarantee to execute competing statements in any particular order unless using @Priority. We therefore recommend that an application does not rely on the order of execution of statements by the runtime, since that best shields the behavior of an application from changes in the order that statements may get created by your application or by threading configurations that your application may change at will.

If your application requires a defined order of execution of competing statements, use the @Priority EPL syntax to make the order of execution between statements well-defined (requires that you set the prioritized-execution configuration setting). And the @Drop can make a statement preempt all other lowered priority ones that then won't get executed for any matching events.

15.8.2.2. Competing Events in a Work Queue

This section discusses the order of event evaluation when multiple events must be processed, for example when multiple statements use insert-into to generate further events upon arrival of an event.

The runtime processes an arriving event completely before indicating output events to listeners and subscribers, and before considering output events generated by insert-into or routed events inserted by listeners or subscribers.

For example, assume three statements (1) select * from MyEvent and (2) insert into ABCStream select * from MyEvent. (3) select * from ABCStream. When a MyEvent event arrives then the listeners to statements (1) and (2) execute first (default threading model). Listeners to statement (3) which receive the inserted-into stream events are always executed after delivery of the triggering event.

Among all events generated by insert-into of statements and the events routed into the runtime via the routeEvent method, all events that insert-into a named window are processed first in the order generated. All other events are processed thereafter in the order they were generated.

When enabling timer or route execution threading as explained under advanced threading options then the runtime does not make any guarantee to the processing order except that is will prioritize events inserted into a named window.

15.9. Controlling Time-Keeping

There are two modes for a runtime to keep track of time: The internal timer based on JVM system time (the default), and externally-controlled (aka. event time) time giving your application full control over the concept of time within a runtime.

By default the internal timer provides time and evaluates schedules. External clocking i.e. event time can be used to supply time ticks to the runtime instead. The latter is useful for when events themselves provide the time to advance. External clocking also helps in testing time-based event sequences or for synchronizing the runtime with an external time source.

The internal timer relies on the java.util.concurrent.ScheduledThreadPoolExecutor class for time tick events. The next section describes timer resolution for the internal timer, by default set to 100 milliseconds but is configurable via the threading options. When using externally-controlled time the timer resolution is in your control.

To disable the internal timer and use externally-provided time instead, there are two options. The first option is to use the configuration API at runtime initialization time. The second option toggles on and off the internal timer at runtime, via special timer control events that are sent into the runtime like any other event.

If using a timer execution thread pool as discussed above, the internal timer or external time event provide the schedule evaluation however do not actually perform the time-based processing. The time-based processing is performed by the threads in the timer execution thread pool.

Tip

External and internal/system time is the same internally to the runtime thus the runtime behaves the same whether using external or internal timer.

This code snippet shows the use of the configuration API to disable the internal timer and thereby turn on externally-provided time (see the Configuration section for configuration via XML file):

Configuration config = new Configuration();
config.getRuntime().getThreading().setInternalTimerEnabled(false);
EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(config);

After disabling the internal timer, it is wise to set a defined time so that any statements created thereafter start relative to the time defined. Use the advanceTime method to indicate current time to the runtime and to move time forward for the runtime (a.k.a application-time model).

This code snippet obtains the current time and advances time:

long timeInMillis = System.currentTimeMillis();
runtime.getEventService().advanceTime(timeInMillis);

To enable or disable the internal timer by API call use the clockExternal and clockInternal methods of EPEventService.

The next code snippet demonstrates toggling to external time:

EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime();
EPEventService eventService = runtime.getEventService();
// switch to external clocking
eventService.clockExternal();

The advanceTime method moves the time forward. All aspects of runtime current time related to statements and patterns are driven by the time that your application advances to.

The next example sequence of instructions sets time to zero, then creates a statement, then moves time forward to 1 seconds later and then 6 seconds later:

// Set start time at zero.
runtime.getEventService().advanceTime(0);

// deploy a module here
// sample EPL: select current_timestamp() as ct from pattern[every timer:interval(1 minute)]
runtime.getDeploymentService().deploy(compiled); // compiled is a module you compiled earlier

// move time forward 1 second
runtime.getEventService().advanceTime(1000);

// move time forward 5 seconds
runtime.getEventService().advanceTime(6000);

When advancing time your application should make sure values are ascending. That is, each time value should be either the same value or a larger value then the prior value provided.

Your application may use the getNextScheduledTime method in EPEventService to determine the earliest time a schedule for any statement requires evaluation.

The following code snippet sets the current time, creates a statement and prints the next scheduled time which is 1 minute later then the current time:

// set start time to the current time.
runtime.getEventService().advanceTime(System.currentTimeMillis());

// deploy a module
// sample EPL: select current_timestamp() as ct from pattern[every timer:interval(1 minute)]
runtime.getDeploymentService().deploy(compiled); // compiled is a module you compiled earlier

// print next schedule time
System.out.println("Next schedule at " + new Date(runtime.getEventService().getNextScheduledTime()));

15.9.1. Controlling Time Using Time Span Events

The advanceTime method allows your application to advance runtime time to a given point in time. In addition, the getNextScheduledTime method in EPEventService returns the next scheduled time according to started statements. You would typically use advanceTime to advance time at a relatively high resolution i.e. milliseconds or microseconds.

To advance time for a span of time without individual calls to advanceTime the API provides the method advanceTimeSpan. The advanceTimeSpan method can accept a resolution parameter.

If your application provides the target end time of a time span to the advanceTimeSpan method and does not provide a resolution, the runtime advances time up to the target time by stepping through all relevant times according to started statements.

If your application provides the target end time of a time span and in addition a long-typed resolution to the advanceTimeSpan method the runtime advances time up to the target time by incrementing time according to the resolution (regardless of next scheduled time according to started statements).

Consider the following example:

// Set start time to Jan.1, 2010, 00:00 am for this example
SimpleDateFormat format = new SimpleDateFormat("yyyy MM dd HH:mm:ss SSS");
Date startTime = format.parse("2010 01 01 00:00:00 000");
runtime.getEventService().advanceTime(startTime.getTime());

// deploy a module
// sample EPL: select current_timestamp() as ct from pattern[every timer:interval(1 minute)]
runtime.getDeploymentService().deploy(compiled); // compiled is a module you compiled earlier
stmt.addListener(...);	// add a listener

// Advance time to 10 minutes after start time
runtime.getEventService().advanceTimeSpan(startTime.getTime() + 10*60*1000));

The above example advances time to 10 minutes after the start time using the advanceTimeSpan method. As the example does not pass a resolution, the runtime advances time according to statement schedules. Upon calling the advanceTimeSpan method the listener sees 10 invocations for minute 1 to minute 10.

To advance time according to a given resolution, you may provide the resolution as shown below:

// Advance time to 10 minutes after start time at 100 msec resolution
runtime.getEventService().advanceTimeSpan(startTime.getTime() + 10*60*1000, 100);

15.9.2. Time Resolution and Time Unit

Time can have a resolution of either milliseconds or microseconds.

The default time resolution is milliseconds. To configure the runtime for microsecond resolution, please see Section 16.4.13.1, “Time Unit”.

Table 15.7. Time Resolution

	Millisecond	Microsecond
Smallest unit for advancing time	1 millisecond	1 microsecond
Equivalent `java.util.concurrent.TimeUnit`	TimeUnit.MILLISECONDS	TimeUnit.MICROSECONDS
Default?	Default	Requires configuration change, see Section 16.4.13.1, “Time Unit”
Long-type runtime time represents	Milliseconds since Epoch	Microseconds since Epoch
Example: the date Tue, 01 Jan 1980 00:00:00 GMT	315532800000	315532800000000
Support for Internal System Time	Yes	No, requires external time (aka. event time) via `advanceTimeSpan` or `advanceTime`

A few notes on usage of microsecond time unit for time resolution:

The runtime automatically computes time periods into microseconds. For example 1 minute 2 seconds is 62000000 microseconds (62 * 1000000).
The runtime automatically computes time-in-second parameters into microseconds. For example 5.02 seconds is 5020000 microseconds.
The runtime automatically computes ISO schedules, crontabs and hints related to runtime time into microseconds.
The CurrentTimeSpanEvent or CurrentTimeEvent events must provide microsecond values.
Date-time methods with long-type input values assume microsecond values.
Date-time methods or other functions that take millisecond parameters or produce millisecond values still consume/produce millisecond values, such as the date-time method toMillisec.
The internal timer must be disabled (setInternalTimerEnabled(false)) and TimerControlEvent.ClockType.CLOCK_INTERNAL cannot be used.

15.9.3. Internal Timer Based on JVM System Time

By default the internal timer is enabled and that tracks VM system time. For many use cases your application may want to use event time or external time instead, as discussed above.

The internal timer thread, by default, uses the call System.currentTimeMillis() to obtain system time. Please see the JIRA issue ESPER-191 Support nano/microsecond resolution for more information on Java system time-call performance, accuracy and drift.

The internal timer thread can be configured to use nanosecond time as returned by System.nanoTime(). If configured for nanosecond time, the runtime computes an offset of the nanosecond ticks to wall clock time upon startup to present back an accurate millisecond wall clock time. Please see section Section 16.6.6, “Runtime Settings Related to Time Source” to configure the internal timer thread to use System.nanoTime().

The internal timer is based on java.util.concurrent.ScheduledThreadPoolExecutor and that generally provides high accuracy VM time (java.util.Timer does not support high accuracy VM time).

15.10. Exception Handling

You may register one or more exception handlers for the runtime to invoke in the case it encounters an exception processing a continuously-executing statement. By default and without exception handlers the runtime cancels execution of the current statement that encountered the exception, logs the exception and continues to the next statement, if any. The configuration is described in Section 16.6.11, “Runtime Settings Related to Exception Handling”.

If your application registers exception handlers as part of runtime configuration, the runtime invokes the exception handlers in the order they are registered passing relevant exception information such as statement name, expression and the exception itself.

Exception handlers receive any statement unchecked exception such as internal exceptions or exceptions thrown by plug-in aggregation functions or plug-in data windows. The runtime does not provide to exception handlers any exceptions thrown by static method invocations for function calls, method invocations in joins, methods on variables and event classes and listeners or subscriber exceptions.

An exception handler can itself throw a runtime exception to cancel execution of the current event against any further statements.

Note

Exceptions are meant to indicate an actual unexpected problem.

We do not recommend explicitly throwing exceptions for the purpose of flow control, preempting execution or other normal situations.

The runtime does not guarantee that throwing an exception has no other side effect and the runtime may not roll back changes that are already made to state.

For fire-and-forget queries the API indicates any exception directly back to the caller without the exception handlers being invoked, as exception handlers apply to statements only. The same applies to any API calls other than sendEvent and the EventSender methods.

As the configuration section describes, your application registers one or more classes that implement the ExceptionHandlerFactory interface in the runtime configuration. Upon runtime initialization the runtime obtains a factory instance from the class name that then provides the exception handler instance. The exception handler class must implement the ExceptionHandler interface.

15.11. Condition Handling

You may register one or more condition handlers for the runtime to invoke in the case it encounters certain conditions, as outlined below, when executing a statement. By default and without condition handlers the runtime logs the condition at informational level and continues processing. The configuration is described in Section 16.6.12, “Runtime Settings Related to Condition Handling”.

If your application registers condition handlers as part of runtime configuration, the runtime invokes the condition handlers in the order they are registered passing relevant condition information such as statement name, expression and the condition information itself.

Currently the only conditions indicated by this facility are raised by the pattern followed-by operator, see Section 7.5.8.1, “Limiting Sub-Expression Count” and see Section 7.5.8.2, “Limiting Runtime-Wide Sub-Expression Count”.

A condition handler may not itself throw a runtime exception or return any value.

As the configuration section describes, your application registers one or more classes that implement the ConditionHandlerFactory interface in the runtime configuration. Upon runtime initialization the runtime obtains a factory instance from the class name that then provides the condition handler instance. The condition handler class must implement the ConditionHandler interface.

15.12. Runtime and Statement Metrics Reporting

The runtime can report key processing metrics through the JMX platform mbean server by setting a single configuration flag described in Section 16.6.7, “Runtime Settings Related to JMX Metrics”. For additional detailed reporting and metrics events, please read on.

Metrics reporting is a feature that allows an application to receive ongoing reports about key runtime-level and statement-level metrics. Examples are the number of incoming events, the CPU time and wall time taken by statement executions or the number of output events per statement.

Metrics reporting is, by default, disabled. To enable reporting, please follow the steps as outlined in Section 16.6.8, “Runtime Settings Related to Metrics Reporting”. Metrics reporting must be enabled at runtime initialization time. Reporting intervals can be controlled at runtime via the EPMetricsService interface available from the runtime API.

Your application can receive metrics at configurable intervals via statement. A metric datapoint is simply a well-defined event. The events are RuntimeMetric and StatementMetric and the Java class representing the events can be found in the client API in package com.espertech.esper.common.client.metric.

Since metric events are processed by the runtime the same as application events, your EPL may use any construct on such events. For example, your application may select, filter, aggregate properties, sort or insert into a stream, named window or table all metric events the same as application events.

This example statement selects all runtime metric events:

select * from RuntimeMetric

The next statement selects all statement metric events:

select * from StatementMetric

Make sure to have metrics reporting enabled since only then do listeners or subscribers to a statement such as above receive metric events.

The runtime provides metric events after the configured interval of time has passed. By default, only started statements that have activity within an interval (in the form of event or timer processing) are reported upon.

The default configuration performs the publishing of metric events in an Esper daemon thread under the control of the runtime instance. Metrics reporting honors externally-supplied time, if using external timer events.

Via runtime configuration options provided by EPMetricsService, your application may enable and disable metrics reporting globally, provided that metrics reporting was enabled at initialization time. Your application may also enable and disable metrics reporting for individual statements by statement name.

Statement groups is a configuration feature that allows to assigning reporting intervals to statements. Statement groups are described further in the Section 16.6.8, “Runtime Settings Related to Metrics Reporting” section. Statement groups cannot be added or removed at runtime.

The following limitations apply:

If your Java VM version does not report current thread CPU time (most JVM do), then CPU time is reported as zero (use ManagementFactory.getThreadMXBean().isCurrentThreadCpuTimeSupported() to determine if your JVM supports this feature).
Note: In some JVM the accuracy of CPU time returned is very low (in the order of 10 milliseconds off) which can impact the usefulness of CPU metrics returned. Consider measuring CPU time in your application thread after sending a number of events in the same thread, external to the runtime as an alternative.
Your Java VM may not provide high resolution time via System.nanoTime. In such case wall time may be inaccurate and inprecise.
CPU time and wall time have nanosecond precision but not necessarily nanosecond accuracy, please check with your Java VM provider.
There is a performance cost to collecting and reporting metrics.
Not all statements may report metrics: The runtime performs certain runtime optimizations sharing resources between similar statements, thereby not reporting on certain statements.

15.12.1. Runtime Metrics

Runtime metrics are properties of RuntimeMetric events:

Table 15.8. RuntimeMetric Properties

Name	Description
runtimeURI	The URI of the runtime.
timestamp	The current runtime time.
inputCount	Cumulative number of input events since runtime initialization time. Input events are defined as events send in via application threads as well as `insert into` events.
inputCountDelta	Number of input events since last reporting period.
scheduleDepth	Number of outstanding schedules.

15.12.2. Statement Metrics

Statement metrics are properties of StatementMetric. The properties are:

Table 15.9. StatementMetric Properties

Name	Description
runtimeURI	The URI of the runtime.
timestamp	The current runtime time.
statementName	Statement name.
cpuTime	Statement processing CPU time (system and user) in nanoseconds (if available by Java VM, obtained from `ThreadMXBean.getCurrentThreadCpuTime`).
wallTime	Statement processing wall time in nanoseconds (based on `System.nanoTime`).
numInput	Number of input events to the statement.
numOutputIStream	Number of insert stream rows output to listeners or the subscriber, if any.
numOutputRStream	Number of remove stream rows output to listeners or the subscriber, if any.

The totals reported are cumulative relative to the last metric report.

15.13. Monitoring and JMX

Enterprise Edition has a library for measuring and reporting memory use for a runtime.

The runtime can report key processing metrics through the JMX platform mbean server by setting a single configuration flag described in Section 16.6.7, “Runtime Settings Related to JMX Metrics”.

Runtime and statement-level metrics reporting is described in Section 15.12, “Runtime and Statement Metrics Reporting”.

The easiest way to see thread contentions is by using VisualVM when Esper is under load and looking at the Threads tab. In the worst case you will see a lot of red color in VisualVM. The red line in VisualVM shows the threads that are either in a monitor region or waiting in an entry set for the monitor. The monitor is the mechanism that Java uses to support synchronization. When a statement is stateful the runtime manages the state using a monitor (lock) per context partition.

A JVM profiler can be handy to see how much CPU is spent in Esper by the sendEvent method.

The jconsole can provide information on the JVM heap. If memory gets tights the performance can drop significantly.

15.14. Event Rendering to XML and JSON

The EPRenderEventService interface offers methods to render events as XML or JSON. Obtain the service from a runtime by calling getRenderEventService on EPRuntime.

Your application may use the built-in XML and JSON formatters to render output events into a readable textual format, such as for integration or debugging purposes. This section introduces the utility classes in the client util package for rendering events to strings. Further API information can be found in the JavaDocs.

For repeated rendering of events of the same event type or subtypes, it is recommended to obtain a JSONEventRenderer or XMLEventRenderer instance and use the render method provided by the interface. This allows the renderer implementations to cache event type metadata for fast rendering.

This example shows how to obtain a renderer for repeated rendering of events of the same type, assuming that statement is an instance of EPStatement:

JSONEventRenderer jsonRenderer = runtime.getRenderEventService().getJSONRenderer(statement.getEventType());

Assuming that event is an instance of EventBean, this code snippet renders an event into the JSON format:

String jsonEventText = jsonRenderer.render("MyEvent", event);

The XML renderer works the same:

XMLEventRenderer xmlRenderer = runtime.getRenderEventService().getXMLRenderer(statement.getEventType());

...and...

String xmlEventText = xmlRenderer.render("MyEvent", event);

If the event type is not known in advance or if you application does not want to obtain a renderer instance per event type for fast rendering, your application can use one of the following methods to render an event to a XML or JSON textual format:

String json = runtime.getRenderEventService().renderJSON(event);
String xml = runtime.getRenderEventService().renderXML(event);

Use the JSONRenderingOptions or XMLRenderingOptions classes to control how events are rendered. To render specific event properties using a custom event property renderer, specify an EventPropertyRenderer as part of the options that renders event property values to strings. Please see the JavaDoc documentation for more information.

15.14.1. JSON Event Rendering Conventions and Options

The JSON renderer produces JSON text according to the standard documented at http://www.json.org.

The renderer formats simple properties as well as nested properties and indexed properties according to the JSON string encoding, array encoding and nested object encoding requirements.

The renderer does render indexed properties, it does not render indexed properties that require an index, i.e. if your event representation is backed by POJO objects and your getter method is getValue(int index), the indexed property values are not part of the JSON text. This is because the implementation has no way to determine how many index keys there are. A workaround is to have a method such as Object[] getValue() instead.

The same is true for mapped properties that the renderer also renders. If a property requires a Map key for access, i.e. your getter method is getValue(String key), such property values are not part of the result text as there is no way for the implementation to determine the key set.

15.14.2. XML Event Rendering Conventions and Options

The XML renderer produces well-formed XML text according to the XML standard.

The renderer can be configured to format simple properties as attributes or as elements. Nested properties and indexed properties are always represented as XML sub-elements to the root or parent element.

The root element name provided to the XML renderer must be the element name of the root in the XML document and may include namespace instructions.

The renderer does render indexed properties, it does not render indexed properties that require an index, i.e. if your event representation is backed by POJO objects and your getter method is getValue(int index), the indexed property values are not part of the XML text. This is because the implementation has no way to determine how many index keys there are. A workaround is to have a method such as Object[] getValue() instead.

15.15. Plug-In Loader

A plug-in loader is for general use with input adapters, output adapters or EPL code deployment or any other task that can benefits from being part of an Esper configuration file and that follows runtime lifecycle.

A plug-in loader implements the com.espertech.esper.runtime.client.plugin.PluginLoader interface and can be listed in the configuration.

Each configured plug-in loader follows the runtime instance lifecycle: When an runtime instance initializes, it instantiates each PluginLoader implementation class listed in the configuration. The runtime then invokes the lifecycle methods of the PluginLoader implementation class before and after the runtime is fully initialized and before an runtime instance is destroyed.

Declare a plug-in loader in your configuration XML as follows:

...
  <plugin-loader name="MyLoader" class-name="org.mypackage.MyLoader">
    <init-arg name="property1" value="val1"/>
  </plugin-loader>
...

Alternatively, add the plug-in loader via the configuration API:

Configuration config = new Configuration();
Properties props = new Properties();
props.put("property1", "value1");
config.getRuntime().addPluginLoader("MyLoader", "org.mypackage.MyLoader", props);

Implement the init method of your PluginLoader implementation to receive initialization parameters. The runtime invokes this method before the runtime is fully initialized, therefore your implementation should not yet rely on the runtime instance within the method body:

public class MyPluginLoader implements PluginLoader {
  public void init(String loaderName, Properties properties, EPRuntime runtime) {
     // save the configuration for later, perform checking
  }
  ...

The runtime calls the postInitialize method once the runtime completed initialization and to indicate the runtime is ready for traffic.

public void postInitialize() {
  // Start the actual interaction with external feeds or the runtime here
}
...

The runtime calls the destroy method once the runtime is destroyed or initialized for a second time.

public void destroy() {
  // Destroy resources allocated as the runtime instance is being destroyed
}

To access the plug-in at runtime, the getContext method provides access under the name plugin-loader/name:

runtime.getContext().getEnvironment().get("plugin-loader/MyLoader");

15.16. Context Partition Selection

This chapter discusses how to select context partitions. Contexts are discussed in Chapter 4, Context and Context Partitions and the reasons for context partition selection are introduced in Section 4.9, “Operations on Specific Context Partitions”.

The section is only relevant when you declare a context. It applies to all different types of hash, partitioned, category, overlapping or other temporal contexts. The section uses a category context for the purpose of illustration. The API discussed herein is general and handles all different types of contexts including nested contexts.

Consider a category context that separates bank transactions into small, medium and large:

// declare category context
create context TxnCategoryContext 
  group by amount < 100 as small, 
  group by amount between 100 and 1000 as medium, 
  group by amount > 1000 as large from BankTxn

// retain 1 minute of events of each category separately
context TxnCategoryContext select * from BankTxn#time(1 minute)

In order for your application to iterate one or more specific categories it is necessary to identify which category, i.e. which context partition, to iterate. Similarly for fire-and-forget queries, to execute fire-and-forget queries against one or more specific categories, it is necessary to identify which context partition to execute the fire-and-forget query against.

Your application may iterate one or more specific context partitions using either the iterate or safeIterate method of EPStatement by providing an implementation of the ContextPartitionSelector interface.

For example, assume your application must obtain all bank transactions for small amounts. It may use the API to identify the category and iterate the associated context partition:

ContextPartitionSelectorCategory categorySmall = new ContextPartitionSelectorCategory() {
    public Set<String> getLabels() {
      return Collections.singleton("small");
    }
  };
Iterator<EventBean> it = stmt.iterator(categorySmall);

Your application may execute fire-and-forget queries against one or more specific context partitions by using the executeQuery method on EPRuntime or the execute method on EPFireAndForgetPreparedQuery and by providing an implementation of ContextPartitionSelector.

Fire-and-forget queries execute against named windows and tables, therefore below statement creates a named window which the runtime manages separately for small, medium and large transactions according to the context declared earlier:

// Named window per category
context TxnCategoryContext create window BankTxnWindow#time(1 min) as BankTxn

The following code demonstrates how to fire a fire-and-forget query against the small and the medium category:

ContextPartitionSelectorCategory categorySmallMed = new ContextPartitionSelectorCategory() {
    public Set<String> getLabels() {
      return new HashSet<String>(Arrays.asList("small", "medium"));
    }
  };
runtime.getFireAndForgetService().executeQuery(
   "select count(*) from BankTxnWindow", 
   new ContextPartitionSelector[] {categorySmallMed});

The following limitations apply:

Fire-and-forget queries may not join named windows or tables that declare a context.

15.16.1. Selectors

This section summarizes the selector interfaces that are available for use to identify and interrogate context partitions. Please refer to the JavaDoc documentation for package com.espertech.esper.common.client.context and classes therein for additional information.

Use an implementation of ContextPartitionSelectorAll or the ContextPartitionSelectorAll.INSTANCE object to instruct the runtime to consider all context partitions.

Use an implementation of ContextPartitionSelectorById if your application knows the context partition ids to query. This selector instructs the runtime to consider only those provided context partitions based on their integer id value. The runtime outputs the context partition id in the built-in property context.id.

Use an implementation of ContextPartitionSelectorFiltered to receive and interrogate context partitions. Use the filter method that receives a ContextPartitionIdentifier to return a boolean indicator whether to include the context partition or not. The ContextPartitionIdentifier provides information about each context partition. Your application may not retain ContextPartitionIdentifier instances between filter method invocations as the runtime reuses the same instance. This selector is not supported with nested contexts.

Use an implementation of ContextPartitionSelectorCategory with category contexts.

Use an implementation of ContextPartitionSelectorSegmented with keyed segmented contexts.

Use an implementation of ContextPartitionSelectorHash with hash segmented contexts.

Use an implementation of ContextPartitionSelectorNested in combination with the selectors described above with nested contexts.

15.17. Context Partition Administration

This chapter briefly discusses the API to manage context partitions. Contexts are discussed in Chapter 4, Context and Context Partitions.

The section is only relevant when you declare a context. It applies to all different types of hash, partitioned, category, overlapping or other temporal contexts.

The EPContextPartitionService interface offers methods to manage context partitions. Obtain the service from a runtime by calling getContextPartitionService on EPRuntime.

The context partition admin API allows an application to:

Interrogate the state and identifiers for existing context partitions.
Determine statements associated to a context and context nesting level.
Receive a callback when new contexts get created and destroyed or when context partitions are allocated and de-allocated.
Obtain context properties.

Please see the JavaDoc documentation for more information.

15.18. Test and Assertion Support

Esper offers a listener and an assertions class to facilitate automated testing of EPL rules, for example when using a test framework such as JUnit or TestNG.

Esper does not require any specific test framework. If your application has the JUnit test framework in classpath Esper uses junit.framework.AssertionFailedError to indicate assertion errors, so as to integrate with continuous integration tools.

For detailed method-level information, please consult the JavaDoc of the package com.espertech.esper.common.client.scopetest and com.espertech.esper.runtime.client.scopetest.

The class com.espertech.esper.common.client.scopetest.EPAssertionUtil provides methods to assert or compare event property values as well as perform various array arthithmatic, sort events and convert events or iterators to arrays.

The class com.espertech.esper.runtime.client.scopetest.SupportUpdateListener provides an UpdateListener implementation that collects events and returns event data for assertion.

The class com.espertech.esper.runtime.client.scopetest.SupportSubscriber provides a subscriber implementation that collects events and returns event data for assertion. The SupportSubscriberMRD is a subscriber that accepts events multi-row delivery. The SupportSubscriber and SupportSubscriberMRD work similar to SupportUpdateListener that is introduced in more detail below.

15.18.1. `EPAssertionUtil` Summary

The below table only summarizes the most relevant assertion methods offered by EPAssertionUtil. Methods provide multiple footprints that are not listed in detail below. Please consult the JavaDoc for additional method-level information.

Table 15.10. Method Summary for EPAssertionUtil

Name	Description
`assertProps`	Methods that assert that property values of a single `EventBean`, POJO or Map matches compared to expected values.
`assertPropsPerRow`	Methods that assert that property values of multiple `EventBean`, POJOs or Maps match compared to expected values.
`assertPropsPerRowAnyOrder`	Same as above, but any row may match. Useful for unordered result sets.
`assertEqualsExactOrder`	Methods that compare arrays, allowing `null`. as parameters.
`assertEqualsAnyOrder`	Same as above, but any row may match. Useful for unordered result sets.

15.18.2. `SupportUpdateListener` Summary

The below table only summarizes the most relevant methods offered by SupportUpdateListener. Please consult the JavaDoc for additional information.

Table 15.11. Method Summary for SupportUpdateListener

Name	Description
`reset`	Initializes listener clearing current events and resetting the invoked flag.
`getAndClearIsInvoked`	Returns the "invoked" flag indicating the listener has been invoked, and clears the flag.
`getLastNewData`	Returns the last events received by the listener.
`getAndResetDataListsFlattened`	Returns all events received by the listener as a pair.
`assertOneGetNewAndReset`	Asserts that exactly one new event was received and no removed events, returns the event and resets the listener.
`assertOneGetNew`	Asserts that exactly one new event was received and returns the event.

15.18.3. Usage Example

The next code block is a short but complete programming example that asserts that the properties received from output events match expected value.

String epl = "select personName, count(*) as cnt from PersonEvent#length(3) group by personName";
Configuration configuration = new Configuration();
configuration.getCommon().addEventType(PersonEvent.class);
CompilerArguments compilerArguments = new CompilerArguments(configuration);
EPCompiled compiled = EPCompilerProvider.getCompiler().compile(stmt, compilerArguments);

EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(configuration);
EPStatement stmt = runtime.getDeploymentService().deploy(compiled).getStatements()[0];

SupportUpdateListener listener = new SupportUpdateListener();
stmt.addListener(listener);

runtime.getEventService().sendEventBean(new PersonEvent("Joe"), "PersonEvent");
EPAssertionUtil.assertProps(listener.assertOneGetNewAndReset(), "personName,cnt".split(","),
    new Object[]{"Joe", 1L});

A few additional examples are shown below:

String[] fields = new String[] {"property"};			
EPAssertionUtil.assertPropsPerRow(listener.getAndResetDataListsFlattened(), fields, 
    new Object[][]{{"E2"}}, new Object[][]{{"E1"}});

EPAssertionUtil.assertPropsPerRow(listener.getAndResetLastNewData(), fields, 
    new Object[][]{{"E1"}, {"E2"}, {"E3"}});

assertTrue(listener.getAndClearIsInvoked());

Please refer to the Esper codebase test sources for more examples using the assertion class and the listener class.

15.19. OSGi, Class Loader, Class-For-Name

When deploying compiled modules the runtime may use a class loader to find resources. Your application has full control over class-for-name and classloader use. OSGi environments can provide a specific class-for-name and class loader. Please refer to Section 16.7, “Passing Services or Transient Objects”.

15.20. When Deploying with J2EE

A compiler and runtime can well be deployed as part of a J2EE web or enterprise application archive to a web application server. When designing for deployment into a J2EE web application server, please consider the items discussed here.

We provide a sample servlet context listener in this section that uses the deployment API to deploy and undeploy modules as part of the servlet lifecycle.

The distribution provides a message-driven bean (MDB) example that you may find useful.

Esper does not have a dependency on any J2EE or Servlet APIs to allow the runtime to run in any environment or container.

15.20.1. J2EE Deployment Considerations

As multiple web applications deployed to a J2EE web application server typically have a separate classloader per application, you should consider whether runtime instances need to be shared between applications or can remain separate runtime instances. Consider the EPRuntimeProvider a Singleton. When deploying multiple web applications, your J2EE container classloader may provide a separate instance of the Singleton EPRuntimeProvider to each web application resulting in multiple independent runtime instances.

To share EPRuntime instances between web applications, one approach is to add the runtime jar files to the system classpath. A second approach can be to have multiple web applications share the same servet context and have your application place the EPRuntime instance into a servlet context attribute for sharing. Architecturally you may also consider a single archived application (such as an message-driven bean) that all your web applications communicate to via the JMS broker provided by your application server or an external JMS broker.

As per J2EE standards there are restrictions in regards to starting new threads in J2EE application code. Esper adheres to these restrictions: It allows to be driven entirely by external events. To remove all Esper threads, set the internal timer off and leave the advanced threading options turned off. To provide timer events when the internal timer is turned off, you should check with your J2EE application container for support of the Java system timer or for support of batch or work loading to send timer events to an runtime instance.

As per J2EE standards there are restrictions in regards to input and output by J2EE application code. Esper adheres to these restrictions: By itself it does not start socket listeners or performs any file IO.

15.20.2. Servlet Context Listener

When deploying a J2EE archive that contains EPL modules files below is sample code to read and deploy EPL modules files packaged with the enterprise or web application archive when the servlet initializes. The sample undeploys EPL modules when the servlet context gets destroyed.

A sample web.xml configuration extract is:

<?xml version="1.0" encoding="UTF-8"?>
<web-app>
  <listener>
    <listener-class>SampleServletListener</listener-class>
  </listener>
  <context-param>
    <param-name>eplmodules</param-name>
    <param-value>switchmonitor.epl</param-value>
</context-param>
</web-app>

A sample servet listener that deploys EPL module files packaged into the archive on context initialization and that undeploys when the application server destroys the context is shown here:

public class SampleServletListener implements ServletContextListener {

  private List<String> deploymentIds = new ArrayList<String>();
  
  public void contextInitialized(ServletContextEvent servletContextEvent) {
    try {
      String modulesList = servletContextEvent.getServletContext().getInitParameter("eplmodules");
      List<Module> modules = new ArrayList<Module>();
      if (modulesList != null) {
        String[] split = modulesList.split(",");
        for (int i = 0; i < split.length; i++) {
          String resourceName = split[i].trim();
          if (resourceName.length() == 0) {
            continue;
          }
          String realPath = servletContextEvent.getServletContext().getRealPath(resourceName);
  		Module module = EPCompilerProvider.getCompiler().readModule(new File(realPath));
          modules.add(module);
        }
      }
    
      // Determine deployment order
      ModuleOrder order = ModuleOrderUtil.getModuleOrder(modules, null);
  
      // Deploy
      for (Module module : order.getOrdered()) {
       // compile and deploy here (code not included), add deployment id
        deploymentIds.add(deployment.getDeploymentId());
      }
    }
    catch (Exception ex) {
      ex.printStackTrace();
    }
  }
  
  public void contextDestroyed(ServletContextEvent servletContextEvent) {
    EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime();
    for (String deploymentId : deploymentIds) {
       runtime.getDeploymentService().undeploy(deploymentId);
    }
  }
}

Chapter 16. Configuration

16.1. Overview

16.2. Programmatic Configuration

16.3. Configuration via XML File

16.4. Configuration Common

16.4.1. Annotation Class and Package Imports
16.4.2. Class and Package Imports
16.4.3. Events Represented by Classes
16.4.4. Events Represented by java.util.Map
16.4.5. Events Represented by Object[] (Object-array)
16.4.6. Events Represented by Avro GenericData.Record
16.4.7. Events Represented by org.w3c.dom.Node
16.4.8. Event Type Defaults
16.4.9. Event Type Import Package (Event Type Auto-Name)
16.4.10. From-Clause Method Invocation
16.4.11. Relational Database Access
16.4.12. Common Settings Related to Logging
16.4.13. Common Settings Related to Time Source
16.4.14. Variables
16.4.15. Variant Stream

16.5. Configuration Compiler

16.5.1. Compiler Settings Related to Byte Code Generation
16.5.2. Compiler Settings Related to View Resources
16.5.3. Compiler Settings Related to Logging
16.5.4. Compiler Settings Related to Stream Selection
16.5.5. Compiler Settings Related to Language and Locale
16.5.6. Compiler Settings Related to Expression Evaluation
16.5.7. Compiler Settings Related to Scripts
16.5.8. Compiler Settings Related to Execution of Statements

16.6. Configuration Runtime

16.6.1. Runtime Settings Related to Concurrency and Threading
16.6.2. Runtime Settings Related to Logging
16.6.3. Runtime Settings Related to Variables
16.6.4. Runtime Settings Related to Patterns
16.6.5. Runtime Settings Related to Match-Recognize
16.6.6. Runtime Settings Related to Time Source
16.6.7. Runtime Settings Related to JMX Metrics
16.6.8. Runtime Settings Related to Metrics Reporting
16.6.9. Runtime Settings Related to Expression Evaluation
16.6.10. Runtime Settings Related to Execution of Statements
16.6.11. Runtime Settings Related to Exception Handling
16.6.12. Runtime Settings Related to Condition Handling

16.7. Passing Services or Transient Objects

16.7.1. Service Example
16.7.2. Class-for-Name
16.7.3. Class Loader

16.8. Type Names

16.9. Logging Configuration

16.9.1. Log4j Logging Configuration

16.1. Overview

Compile-time and runtime configuration is entirely optional. The compiler and runtime work out-of-the-box without configuration.

All configuration lives in the Configuration class (com.espertech.esper.common.client.configuration.Configuration).

The configuration class has configure methods that can read configuration XML and that add the information contained in the XML to the configuration. You can read multiple XML sources additively.

A configuration has three sections:

The common section with configuration that both the compiler and the runtime may use, represented by the ConfigurationCommon class.
The compiler section, which provides configuration for use only by the compiler, represented by the ConfigurationCompiler class.
The runtime section, which provides configuration for use only by the runtime, represented by the ConfigurationRuntime class.

Configuration is an initialization-time object. The compiler does not retain any association back to configuration. The runtime makes a deep copy of the configuration object available (see getConfigurationDeepCopy on EPRuntime) but the configuration object cannot be changed once provided to the runtime.

16.2. Programmatic Configuration

You may obtain a Configuration instance by instantiating it directly and adding or setting values on it.

The following example code adds a preconfigured event type and adds an import to the common section of the configuration.

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("PriceLimit", PriceLimit.class.getName());
configuration.getCommon().addImport("org.mycompany.mypackage.MyUtility");

The above example adds a preconfigured event type. For adding an event type at runtime please use create schema.

16.3. Configuration via XML File

In addition to programmatic configuration, or as an alternative approach, you may specify configuration items in XML files.

The default name for the XML configuration file is esper.cfg.xml. The configuration class reads this file from the root of the CLASSPATH as an application resource via the configure method.

Configuration configuration = new Configuration();		
configuration.configure();

The Configuration class can read the XML configuration file from other sources as well. The configure method accepts URL, File and String filename parameters.

Configuration configuration = new Configuration();		
configuration.configure("myconfigfile.esper.cfg.xml");

The schema for the configuration file can be found in the etc folder and is named esper-configuration-majorversion-0.xsd. The schema is available online at http://www.espertech.com/schema/esper/esper-configuration-majorversion-0.xsd so that a tool can fetch it automatically. The namespace is http://www.espertech.com/schema/esper.

You can use the XML schema file to validate that your XML configuration file is valid.

Here is an example configuration file.

<?xml version="1.0" encoding="UTF-8"?>
<esper-configuration xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xmlns="http://www.espertech.com/schema/esper"
    xsi:schemaLocation="
http://www.espertech.com/schema/esper
http://www.espertech.com/schema/esper/esper-configuration-8-0.xsd">
  <common>
    <event-type name="PriceLimit" class="com.espertech.esper.example.stockticker.event.PriceLimit"/>
    <auto-import import-name="org.mycompany.mypackage.MyUtility"/>
  </common>
</esper-configuration>

16.4. Configuration Common

The common section of the configuration applies to the compiler and also applies to the runtime.

16.4.1. Annotation Class and Package Imports

If your application has certain classes or packages that should only be visible within an @-annotation, you may add these to the annotation imports list. Such classes are only visible when used in an annotation and not elsewhere.

In a XML configuration file the auto-import-annotation configuration may look as below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <auto-import-annotations import-name="com.mycompany.mypackage.myannotations.*"/>
  </common>
</esper-configuration>

Here is an example of providing annotation-only imports via the API:

Configuration config = new Configuration();
// package import, only visible for annotation use
config.getCommon().addAnnotationImport("com.mycompany.mypackage.myannotations.*");

16.4.2. Class and Package Imports

EPL allows invocations of static Java library functions in expressions, as outlined in Section 10.1, “Single-Row Function Reference”. This configuration item can be set to allow a partial rather than a fully qualified class name in such invocations. The imports work in the same way as in Java files, so both packages and classes can be imported.

select Math.max(priceOne, PriceTwo)
// via configuration equivalent to
select java.lang.Math.max(priceOne, priceTwo)

EPL auto-imports the following Java library packages. Any additional imports that are specified in configuration files or through the API are added to the configuration in addition to the imports below.

java.lang.*
java.math.*
java.text.*
java.util.*

In a XML configuration file the auto-import configuration may look as below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <auto-import import-name="com.mycompany.mypackage.*"/>
    <auto-import import-name="com.mycompany.myapp.MyUtilityClass"/>  
  </common>
</esper-configuration>

Here is an example of providing imports via the API:

Configuration config = new Configuration();
config.getCommon().addImport("com.mycompany.mypackage.*");	// package import
config.getCommon().addImport("com.mycompany.mypackage.MyLib");   // class import

16.4.3. Events Represented by Classes

This section is relevant if you want to use regular classes to represent events.

The runtime can process event objects via the sendEventBean(Object event, String eventTypeName) method on the EPEventService interface.

16.4.3.1. Bean-Style Classes

For JavaBean-style classes that have getter methods please specify an event type name and the class name or class. Interfaces and abstract classes are also supported.

The below sample XML configures an event type named StockTick and provides the fully-qualified class name.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="StockTick" class="com.espertech.esper.example.stockticker.event.StockTick"/>
  </common>
</esper-configuration>

The sample code for the configuration is:

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("StockTick", StockTick.class.getName());

16.4.3.2. Non-JavaBean and Legacy Java Event Classes

You can use this setting herein when method and member variable names in your Java class do not adhere to the JavaBean convention - any public methods and public member variables can be exposed as event properties via the below configuration.

A Java class can optionally be configured with an accessor style attribute. This attribute instructs the compiler how it should expose methods and fields for use as event properties in statements.

Table 16.1. Accessor Styles

Style Name	Description
`javabean`	As the default setting, the compiler exposes an event property for each public method following the JavaBean getter-method conventions
`public`	The compiler exposes an event property for each public method and public member variable of the given class
`explicit`	The compiler exposes an event property only for the explicitly configured public methods and public member variables

For NEsper .NET accessor styles are NATIVE, PUBLIC and EXPLICIT.

Using the public setting for the accessor-style attribute instructs the compiler to expose an event property for each public method and public member variable of a Java class. The compiler assigns event property names of the same name as the name of the method or member variable in the Java class.

For example, assuming the class MyLegacyEvent exposes a method named readValue and a member variable named myField, you can then use properties as shown.

select readValue, myField from MyLegacyEvent

Using the explicit setting for the accessor-style attribute requires that event properties are declared via configuration. This is outlined in the next chapter.

When configuring a compiler or runtime from a XML configuration file, the XML snippet below demonstrates the use of the legacy-type element and the accessor-style attribute.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyLegacyEvent" class="com.mycompany.mypackage.MyLegacyEventClass">
      <legacy-type accessor-style="public"/>
    </event-type>
  </common>
</esper-configuration>

When configuring an compiler or runtime via Configuration API, the sample code below shows how to set the accessor style.

Configuration configuration = new Configuration();
ConfigurationCommonEventTypeBean legacyDef = new ConfigurationCommonEventTypeBean();
legacyDef.setAccessorStyle(AccessorStyle.PUBLIC);
configuration.getCommon().addEventType("MyLegacyEvent", MyLegacyEventClass.class.getName(), legacyDef);

16.4.3.3. Specifying Event Properties for Java Classes

Sometimes it may be convenient to use event property names in pattern and statements that are backed up by a given public method or member variable (field) in a Java class. And it can be useful to declare multiple event properties that each map to the same method or member variable.

We can configure properties of events via method-property and field-property elements, as the next example shows.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
      <event-type name="StockTick" class="com.espertech.esper.example.stockticker.event.StockTickEvent">
        <legacy-type accessor-style="javabean" code-generation="enabled">
          <method-property name="price" accessor-method="getCurrentPrice" />
          <field-property name="volume" accessor-field="volumeField" />
       </legacy-type>
    </event-type>
  </common>
</esper-configuration>

The XML configuration snippet above declared an event property named price backed by a getter-method named getCurrentPrice, and a second event property named volume that is backed by a public member variable named volumeField. Thus the price and volume properties can be used in a statement:

select avg(price * volume) from StockTick

As with all configuration options, the API can also be used:

Configuration configuration = new Configuration();
ConfigurationCommonEventTypeBean legacyDef = new ConfigurationCommonEventTypeBean();
legacyDef.addMethodProperty("price", "getCurrentPrice");
legacyDef.addFieldProperty("volume", "volumeField");
configuration.getCommon().addEventType("StockTick", StockTickEvent.class.getName(), legacyDef);

16.4.3.4. Case Sensitivity and Property Names

By default the compiler resolves Java event properties case sensitive. That is, property names in statements must match JavaBean-convention property names in name and case. This option controls case sensitivity per Java class.

In the configuration XML, the optional property-resolution-style attribute in the legacy-type element can be set to any of these values:

Table 16.2. Property Resolution Case Sensitivity Styles

Style Name	Description
`case_sensitive (default)`	As the default setting, the compiler matches property names for the exact name and case only.
`case_insensitive`	Properties are matched if the names are identical. A case insensitive search is used and will choose the first property that matches the name exactly or the first property that matches case insensitively should no match be found.
`distinct_case_insensitive`	Properties are matched if the names are identical. A case insensitive search is used and will choose the first property that matches the name exactly case insensitively. If more than one 'name' can be mapped to the property an exception is thrown.

The sample below shows this option in XML configuration, however the setting can also be changed via API:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
      <event-type name="MyLegacyEvent" class="com.mycompany.package.MyLegacyEventClass">
        <legacy-type property-resolution-style="case_insensitive"/>
      </event-type>
  </common>
</esper-configuration>

16.4.3.5. Factory and Copy Method

The insert into clause and directly instantiate and populate your event object. By default the runtime invokes the default constructor to instantiate an event object. To change this behavior, you may configure a factory method. The factory method is a method name or a class name plus a method name (in the format class.method) that returns an instance of the class.

The update clause can change event properties on an event object. For the purpose of maintaining consistency, the runtime may have to copy your event object via serialization (implement the java.io.Serializable interface). If instead you do not want any copy operations to occur, or your application needs to control the copy operation, you may configure a copy method. The copy method is the name of a method on the event object that copies the event object.

The sample below shows this option in XML configuration, however the setting can also be changed via ConfigurationCommonEventTypeBean:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyLegacyEvent" class="com.mycompany.package.MyLegacyEventClass">
      <legacy-type factory-method="com.mycompany.myapp.MySampleEventFactory.createMyLegacyTypeEvent" copy-method="myCopyMethod"/>
    </event-type>
  </common>
</esper-configuration>

The copy method should be a public method that takes no parameters and returns a new event object (it may not return this). The copy method may not be a static method and may not take parameters.

The Beacon data flow operator in connection with the Sun JVM can use sun.reflect.ReflectionFactory if the class has no default no-argument constructor.

16.4.3.6. Start and End Timestamp

For use with date-time interval methods, for example, you may let the compiler know which property of your class carries the start and end timestamp value.

The sample below shows this option in XML configuration, however the setting can also be changed via API. The sample sets the name of the property providing the start timestamp to startts and the name of the property providing the end timestamp endts:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyLegacyEvent" class="com.mycompany.package.MyLegacyEventClass">
      <legacy-type start-timestamp-property-name="startts" end-timestamp-property-name="endts"/>
    </event-type>
  </common>
</esper-configuration>

16.4.4. Events Represented by `java.util.Map`

The runtime can process java.util.Map events via the sendEventMap(Map map, String eventTypeName) method on the EPEventService interface. Entries in the Map represent event properties. Please see the Appendix E, Event Representation: java.util.Map Events section for details on how to use Map events with the runtime.

You can provide an event type name for Map events.

The below snippet of XML configuration configures an event type named MyMapEvent.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyMapEvent">
      <java-util-map>
        <map-property name="carId" class="int"/>
        <map-property name="carType" class="string"/>
        <map-property name="assembly" class="com.mycompany.Assembly"/>    
      </java-util-map>
    </event-type>
  </common>
</esper-configuration>

For NEsper .NET use util-map instead of java-util-map.

This configuration defines the carId property of MyMapEvent events to be of type int, and the carType property to be of type java.util.String. The assembly property of the Map event will contain instances of com.mycompany.Assembly for the runtime to query.

The valid types for the class attribute are listed in Section 16.8, “Type Names”. In addition, any fully-qualified Java class name that can be resolved via Class.forName is allowed.

You can also use the configuration API to configure Map event types, as the short code snippet below demonstrates:

Map<String, Object> properties = new LinkedHashMap<String, Object>();
properties.put("carId", "int");
properties.put("carType", "string");
properties.put("assembly", Assembly.class.getName());

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("MyMapEvent", properties);

A Map event type may also become a subtype of one or more supertypes that must also be Map event types. The java-util-map element provides the optional attribute supertype-names that accepts a comma-separated list of names of Map event types that are supertypes to the type:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="AccountUpdate">
      <java-util-map supertype-names="BaseUpdate, AccountEvent">
      </java-util-map>
    </event-type>
  </common>
</esper-configuration>

A Map event type may declare a start and end timestamp property name. The XML shown next instructs the compiler that the startts property carries the event start timestamp and the endts property carries the event end timestamp:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="AccountUpdate">
      <java-util-map start-timestamp-property-name="startts" end-timestamp-property-name="endts">
      </java-util-map>
    </event-type>
  </common>
</esper-configuration>

For adding a type at runtime please use create map schema.

16.4.5. Events Represented by `Object[]` (Object-array)

The runtime can process Object-array (Object[]) events via the sendEventObjectArray(Object[] array, String eventTypeName) method on the EPEventService interface. Elements in the Object array represent event properties. Please see the Appendix F, Event Representation: Object-Array (Object[]) Events section for details on how to use Object[] events with the runtime.

The below snippet of XML configuration configures an event type named MyObjectArrayEvent.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyObjectArrayEvent">
      <objectarray>
        <objectarray-property name="carId" class="int"/>
        <objectarray-property name="carType" class="string"/>
        <objectarray-property name="assembly" class="com.mycompany.Assembly"/>    
      </objectarray>
    </event-type>
  </common>
</esper-configuration>

This configuration defines the carId property of MyObjectArrayEvent events to be of type int and in the object array first element ([0]). The carType property to be of type java.util.String is expected in the second array element ([1]) . The assembly property of the object array event will contain instances of com.mycompany.Assembly for the runtime to query in element two ([2]).

Note that the runtime does not verify the length and property values of object array events when your application sends object-array events into the runtime. For the example above, the proper object array would look as follows: new Object[] {carId, carType, assembly}.

The valid types for the class attribute are listed in Section 16.8, “Type Names”. In addition, any fully-qualified Java class name that can be resolved via Class.forName is allowed.

You can also use the configuration API to configure Object[] event types, as the short code snippet below demonstrates:

String[] propertyNames = {"carId", "carType", "assembly"};
Object[] propertyTypes = {int.class, String.class, Assembly.class};

Configuration configuration = new Configuration();
configuration.getCommon().addEventType("MyObjectArrayEvent", propertyNames, propertyTypes);

An Object-array event type may also become a subtype of one supertype that must also be an Object-array event type. The objectarray element provides the optional attribute supertype-names that accepts a single name of an Object-array event type that is the supertype to the type:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="AccountUpdate">
      <objectarray supertype-names="BaseUpdate">
      </objectarray>
    </event-type>
  </common>
</esper-configuration>

An Object-array event type may declare a start and end timestamp property name. The XML shown next instructs the compiler that the startts property carries the event start timestamp and the endts property carries the event end timestamp:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="AccountUpdate">
      <objectarray start-timestamp-property-name="startts" end-timestamp-property-name="endts"/>
    </event-type>
  </common>
</esper-configuration>

For adding a type at runtime please use create objectarray schema.

16.4.6. Events Represented by Avro `GenericData.Record`

The runtime can process Avro GenericData.Record events via the sendEventAvro(GenericData.Record event, String eventTypeName) method on the EPEventService interface. Please see the Appendix G, Event Representation: Avro Events (org.apache.avro.generic.GenericData.Record) section for details on how to use Avro events with the compiler and runtime.

The below snippet of XML configuration configures an event type named MyAvroEvent.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyAvroEvent">
      <avro schema-text='{"type":"record","name":"MyAvroEvent","fields":[{"name":"carId","type":"int"},{"name":"carType","type":{"type":"string","avro.java.string":"String"}}]}'/>
    </event-type>  
  </common>
</esper-configuration>

The sample Avro schema above in pretty-print is:

{
  "type" : "record",
  "name" : "MyAvroEvent",
  "fields" : [ {
    "name" : "carId",
    "type" : "int"
  }, {
    "name" : "carType",
    "type" : {
      "type" : "string",
      "avro.java.string" : "String"
    }
  } ]
}

This schema defines:

A carId property of type int.
A carType property of type string. Note:Wse the Avro-provided avro.java.string property to ensure is is a java.lang.String instance and not a java.lang.CharSequence) instance.

Note that the runtime does not verify that Avro events are valid or that they actually match the schema provided for the Avro event type.

You can also use the configuration API to configure Avro event types, as the short code snippet below demonstrates:

Configuration configuration = new Configuration();
ConfigurationCommonEventTypeAvro avroType = new ConfigurationCommonEventTypeAvro();
avroType.setAvroSchema(schema);
configuration.getCommon().addEventTypeAvro("MyAvroType", avroType);

For adding a type at runtime please use create avro schema.

An Avro event type may also become a subtype of one supertype that must also be an Avro event type. The avro element provides the optional attribute supertype-names that accepts a single name of an Avro event type that is the supertype to the type:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyAvroEvent">
      <avro supertype-names="BaseUpdate"/>
    </event-type>  
  </common>
</esper-configuration>

An Avro event type may declare a start and end timestamp property name. The XML shown next instructs the compiler that the startts property carries the event start timestamp and the endts property carries the event end timestamp:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyAvroEvent">
      <avro start-timestamp-property-name="startts" end-timestamp-property-name="endts"/>
    </event-type>  
  </common>
</esper-configuration>

16.4.7. Events Represented by `org.w3c.dom.Node`

Via this configuration item the runtime can natively process org.w3c.dom.Node instances, i.e. XML document object model (DOM) nodes. Please see the Appendix H, Event Representation: org.w3c.dom.Node XML Events section for details on how to use Node events with the compiler and runtime.

EPL allows configuring XPath expressions as event properties. You can specify arbitrary XPath functions or expressions and provide a property name by which their result values will be available for use in expressions.

For XML documents that follow a XML schema, the compiler and runtime can load and interrogate your schema and validate event property names and types against the schema information.

Nested, mapped and indexed event properties are also supported in expressions against org.w3c.dom.Node events. Thus XML trees can conveniently be interrogated using the existing event property syntax for querying JavaBean objects, JavaBean object graphs or java.util.Map events.

In the simplest form, the compiler only requires a configuration entry containing the root element name and the event type name in order to process org.w3c.dom.Node events:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="MyXMLNodeEvent">
      <xml-dom root-element-name="myevent" />
    </event-type>
  </common>
</esper-configuration>

You can also use the configuration API to configure XML event types, as the short example below demonstrates. In fact, all configuration options available through XML configuration can also be provided via setter methods on the ConfigurationEventTypeXMLDOM class.

Configuration configuration = new Configuration();
ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setRootElementName("myevent");
desc.addXPathProperty("name1", "/element/@attribute", XPathConstants.STRING);
desc.addXPathProperty("name2", "/element/subelement", XPathConstants.NUMBER);
configuration.getCommon().addEventType("MyXMLNodeEvent", desc);

The next example presents configuration options in a sample configuration entry.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type name="AutoIdRFIDEvent">
      <xml-dom root-element-name="Sensor" schema-resource="data/AutoIdPmlCore.xsd" 
         default-namespace="urn:autoid:specification:interchange:PMLCore:xml:schema:1">
        <namespace-prefix prefix="pmlcore" 
           namespace="urn:autoid:specification:interchange:PMLCore:xml:schema:1"/>
        <xpath-property property-name="countTags" 
           xpath="count(/pmlcore:Sensor/pmlcore:Observation/pmlcore:Tag)" type="number"/>
      </xml-dom>
    </event-type>
  </common>
</esper-configuration>

This example configures an event property named countTags whose value is computed by an XPath expression. The namespace prefixes and default namespace are for use with XPath expressions and must also be made known to the compiler and runtime in order for the compiler/runtime to compile XPath expressions. Via the schema-resource attribute you can instruct the compiler/runtime to load a schema file. You may also use schema-text instead to provide the actual text of the schema.

16.4.7.1. Schema Resource

The schema-resource attribute takes a schema resource URL or classpath-relative filename. The compiler and runtime attempts to resolve the schema resource as an URL. If the schema resource name is not a valid URL, the compiler and runtime attempts to resolve the resource from classpath via the ClassLoader.getResource method using the thread context class loader. If the name could not be resolved, the compiler and runtime uses the Configuration class classloader. Use the schema-text attribute instead when it is more practical to provide the actual text of the schema.

By configuring a schema file for the compiler or runtime to load, the compiler performs these additional services:

Validates the event properties in a statement, ensuring the event property name matches an attribute or element in the XML
Determines the type of the event property allowing event properties to be used in type-sensitive expressions such as expressions involving arithmetic (Note: XPath properties are also typed)
Matches event property names to either element names or attributes

If no schema resource is specified, none of the event properties specified in statements are validated at compile-time and their type defaults to java.lang.String. Also, attributes are not supported if no schema resource is specified and must thus be declared via XPath expression.

16.4.7.2. Explicit XPath Property

The xpath-property element adds explicitly-names event properties to the event type that are computed via an XPath expression. In order for the XPath expression to compile, be sure to specify the default-namespace attribute and use the namespace-prefix to declare namespace prefixes.

XPath expression properties are strongly typed. The type attribute allows the following values. These values correspond to those declared by javax.xml.xpath.XPathConstants.

number (Note: resolves to a double)
string
boolean
node
nodeset

In case you need your XPath expression to return a type other than the types listed above, an optional cast-to type can be specified. If specified, the operation firsts obtains the result of the XPath expression as the defined type (number, string, boolean) and then casts or parses the returned type to the specified cast-to-type. At runtime, a warning message is logged if the XPath expression returns a result object that cannot be casted or parsed.

The next line shows how to return a long-type property for an XPath expression that returns a string:

desc.addXPathProperty("name", "/element/sub", XPathConstants.STRING, "long");

The equivalent configuration XML is:

<xpath-property property-name="name"  xpath="/element/sub" type="string" cast="long"/>

See Section 16.8, “Type Names” for a list of cast-to type names.

16.4.7.3. Absolute or Deep Property Resolution

This setting indicates that when properties are compiled to XPath expressions that the compilation should generate an absolute XPath expression or a deep (find element) XPath expression.

For example, consider the following statement against an event type that is represented by a XML DOM document, assuming the event type GetQuote has been configured with the compiler as a XML DOM event type:

select request, request.symbol from GetQuote

By default, the compiler compiles the "request" property name to an XPath expression "/GetQuote/request". It compiles the nested property named "request.symbol" to an XPath expression "/GetQuote/request/symbol", wherein the root element node is "GetQuote".

By setting absolute property resolution to false, the compiler compiles the "request" property name to an XPath expression "//request". It compiles the nested property named "request.symbol" to an XPath expression "//request/symbol". This enables these elements to be located anywhere in the XML document.

The setting is available in XML via the attribute resolve-properties-absolute.

The configuration API provides the above settings as shown here in a sample code:

Configuration configuration = new Configuration();
ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setRootElementName("GetQuote");
desc.setDefaultNamespace("http://services.samples/xsd");
desc.setRootElementNamespace("http://services.samples/xsd");
desc.addNamespacePrefix("m0", "http://services.samples/xsd");
desc.setXPathResolvePropertiesAbsolute(false);
configuration.getCommon().addEventType("GetQuote", desc);

16.4.7.4. XPath Variable and Function Resolver

If your XPath expressions require variables or functions, your application may provide the class name of an XPathVariableResolver and XPathFunctionResolver. At type initialization time the compiler and runtime instantiates the resolver instances and provides these to the XPathFactory.

This example shows the API to set this configuration.

ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setXPathFunctionResolver(MyXPathFunctionResolver.class.getName());
desc.setXPathVariableResolver(MyXPathVariableResolver.class.getName());

16.4.7.5. Auto Fragment

This option is for use when a XSD schema is provided and determines whether the compiler automatically creates an event type when a property expression transposes a property that is a complex type according to the schema.

An example:

ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setAutoFragment(false);

16.4.7.6. XPath Property Expression

By default the compiler and runtime employs the built-in DOM walker implementation to evaluate XPath expressions, which is not namespace-aware.

This configuration setting, when set to true, instructs the compiler to rewrite property expressions into XPath.

An example:

ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setXPathPropertyExpr(true);

16.4.7.7. Event Sender Setting

By default an EventSender for a given XML event type validates the root element name for which the type has been declared against the one provided by the org.w3c.Node sent into the runtime.

This configuration setting, when set to false, instructs an EventSender to not validate.

An example:

ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setEventSenderValidatesRoot(false);

16.4.7.8. Start and End Timestamp

You may configure the name of the properties that provides the event start timestamp and the event end timestamp as part of the configuration.

An example that configures startts as the property name providing the start timestamp and endts as the property name providing the end timestamp:

ConfigurationCommonEventTypeXMLDOM desc = new ConfigurationCommonEventTypeXMLDOM();
desc.setStartTimestampPropertyName("startts");
desc.setEndTimestampPropertyName("endts");

16.4.8. Event Type Defaults

16.4.8.1. Default Event Representation

The default event representation is the Map event representation.

The default event representation is relevant when your query outputs individual properties to a listener and it does not specify a specific event representation in an annotation. The default event representation is also relevant for create schema and create window.

Note that the compiler may still use the Map representation for certain types of statements even when the default event representation is object array.

For example, consider the following statement:

select propertyOne, propertyTwo from MyEvent

Listeners to the statement above currently receive a Map-type event. By setting the configuration flag to object-array or Avro as described herein, listeners to the statement receive an Object-array-type event or an Avro-type event instead.

The XML snippet below is an example of setting the default event representation to Object-array:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-meta>
      <event-representation type="objectarray"/> <!-- use "avro" for Avro -->
    </event-meta>
  </common>
</esper-configuration>

The code snippet shown next sets the default event representation to Object-array in the configuration object:

Configuration configuration = new Configuration();
configuration.getCommon().getEventMeta().
    setDefaultEventRepresentation(EventUnderlyingType.OBJECTARRAY);

16.4.8.2. Avro Settings

This configuration controls compiler settings in respect to Avro.

The enable-avro setting is boolean-typed and is true by default. It controls whether Avro is enabled or disabled. If disabled the compiler and runtime disallow registering Avro event types or using an Avro event representation.

The enable-native-string setting is boolean-typed and is true by default. It controls whether for String-type values, when the compiler generates an Avro schema, such field schema adds the property avro.java.string of value String.

The enable-schema-default-nonnull setting is boolean-typed and is true by default. It controls whether the compiler assembles non-null Avro schemas (true) or nullable (union) Avro schemas (false).

The objectvalue-typewidener-factory-class setting is a fully-qualified class name of the class implementing the com.espertech.esper.common.client.hook.type.ObjectValueTypeWidenerFactory interface and is null by default. If specified the factory can provide a type widener for widening, coercing or transforming any object value to a Avro field value.

The type-representation-mapper-class setting is a fully-qualified class name of the class implementing the com.espertech.esper.common.client.hook.type.TypeRepresentationMapper interface and is null by default. If specified the implementation can provide for a given class the Avro schema for the field.

The XML snippet below is an example of Avro settings that configures the same as the default values:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-meta>
      <avro-settings enable-avro="true" enable-native-string="true" enable-schema-default-nonnull="true"
          objectvalue-typewidener-factory-class=""
          type-representation-mapper-class=""/>
    </event-meta>
  </common>
</esper-configuration>

The code snippet shown next sets the default event representation to Object-array in the configuration object:

Configuration configuration = new Configuration();
configuration.getCommon().getEventMeta().getAvroSettings().setEnableAvro(true);
configuration.getCommon().getEventMeta().getAvroSettings().setEnableNativeString(true);
configuration.getCommon().getEventMeta().getAvroSettings().setEnableSchemaDefaultNonNull(true);
configuration.getCommon().getEventMeta().getAvroSettings().setObjectValueTypeWidenerFactoryClass(null);
configuration.getCommon().getEventMeta().getAvroSettings().setTypeRepresentationMapperClass(null);

16.4.8.3. Java Class Property Names, Case Sensitivity and Accessor Style

This allows to control case sensitivity or accessor style for all event classes as a default. The two settings are found under class-property-resolution under event-meta in the XML common configuration.

To control the case sensitivity as discussed in Section 16.4.3.4, “Case Sensitivity and Property Names”, add the style attribute in the XML configuration to set a default case sensitivity applicable to all event classes unless specifically overridden by class-specific configuration. The default case sensitivity is case_sensitive (case sensitivity turned on).

To control the accessor style as discussed in Section 16.4.3.2, “Non-JavaBean and Legacy Java Event Classes”, add the accessor-style attribute in the XML configuration to set a default accessor style applicable to all event classes unless specifically overridden by class-specific configuration. The default accessor style is javabean JavaBean accessor style.

The next code snippet shows how to control this feature via the API:

Configuration config = new Configuration();
config.getCommon().getEventMeta().setClassPropertyResolutionStyle(
    PropertyResolutionStyle.CASE_INSENSITIVE);
config.getCommon().getEventMeta().setDefaultAccessorStyle(
    AccessorStyle.PUBLIC);

16.4.9. Event Type Import Package (Event Type Auto-Name)

Via this configuration an application can make the Java package or packages that contain an application's Java event classes known. Thereby an application can use create schema name as simple-classname and the compiler can find the class.

The XML configuration for defining the Java packages that contain Java event classes is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <event-type-auto-name package-name="com.mycompany.order.event"/>
  </common>
</esper-configuration>

The same configuration but using the Configuration class:

Configuration config = new Configuration();
config.getCommon().addEventTypeAutoName("com.mycompany.order.event");
// ... or ...
config.getCommon().addEventTypeAutoName(MyEvent.getPackage().getName());

16.4.10. From-Clause Method Invocation

Method invocations are allowed in the from clause in EPL, such that your application may join event streams to the data returned by a web service, or to data read from a distributed cache or object-oriented database, or obtain data by other means. A local cache may be placed in front of such method invocations through the configuration settings described herein.

The LRU cache is described in detail in Section 16.4.11.6.1, “LRU Cache”. The expiry-time cache documentation can be found in Section 16.4.11.6.2, “Expiry-Time Cache”

The next XML snippet is a sample cache configuration that applies to methods provided by the classes 'MyFromClauseLookupLib' and 'MyFromClauseWebServiceLib'. The XML and API configuration understand both the fully-qualified Java class name, as well as the simple class name:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <method-reference class-name="com.mycompany.MyFromClauseLookupLib">
      <expiry-time-cache max-age-seconds="10" purge-interval-seconds="10" ref-type="weak"/>
    </method-reference> 	
    <method-reference class-name="MyFromClauseWebServiceLib">
      <lru-cache size="1000"/>
    </method-reference>
  </common>
</esper-configuration>

16.4.11. Relational Database Access

For NEsper .NET also see Section I.17, “.NET Configurations - Relational Database Access”.

EPL has the capability to join event streams against historical data sources, such as a relational database. This section describes the configuration entries that the compiler or runtime require to access data stored in your database. Please see Section 5.13, “Accessing Relational Data via SQL” for information on the use of EPL queries that include historical data sources.

EPL queries that poll data from a relational database specify the name of the database as part of the statement. The compiler and runtime use the configuration information described here to resolve the database name in the statement to database settings. The required and optional database settings are summarized below.

Database connections can be obtained via JDBC javax.xml.DataSource, via java.sql.DriverManager and via data source factory. Either one of these methods to obtain database connections is a required configuration.
Optionally, JDBC connection-level settings such as auto-commit, transaction isolation level, read-only and the catalog name can be defined.
Optionally, a connection lifecycle can be set to indicate to the runtime whether the runtime must retain connections or must obtain a new connection for each lookup and close the connection when the lookup is done (pooled).
Optionally, define a cache policy to allow the runtime to retrieve data from a query cache, reducing the number of query executions.

Some of the settings can have important performance implications that need to be carefully considered in relationship to your database software, JDBC driver and runtime environment. This section attempts to outline such implications where appropriate.

The sample XML configuration file in the "etc" folder can be used as a template for configuring database settings. All settings are also available by means of the configuration API through the classes Configuration and ConfigurationDBRef.

16.4.11.1. Connections Obtained via DataSource

This configuration causes the compiler or runtime to obtain a database connection from a javax.sql.DataSource available from your JNDI provider.

The setting is most useful when running within an application server or when a JNDI directory is otherwise present in your Java VM. If your application environment does not provide an available DataSource, the next section outlines how to use Apache DBCP as a DataSource implementation with connection pooling options and outlines how to use a custom factory for DataSource implementations.

If your DataSource provides connections out of a connection pool, your configuration should set the collection lifecycle setting to pooled.

The snippet of XML below configures a database named mydb1 to obtain connections via a javax.sql.DataSource. The datasource-connection element instructs the runtime to obtain new connections to the database mydb1 by performing a lookup via javax.naming.InitialContext for the given object lookup name. Optional environment properties for the InitialContext are also shown in the example.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb1">
      <datasource-connection context-lookup-name="java:comp/env/jdbc/mydb">
        <env-property name="java.naming.factory.initial" value ="com.myclass.CtxFactory"/>
        <env-property name="java.naming.provider.url" value ="iiop://localhost:1050"/>
      </datasource-connection>
    </database-reference>
  </common>
</esper-configuration>

To help you better understand how the runtime uses this information to obtain connections, please look at the logic below.

if (envProperties.size() > 0) {
  initialContext = new InitialContext(envProperties);
}
else {
  initialContext = new InitialContext();
}
DataSource dataSource = (DataSource) initialContext.lookup(lookupName);
Connection connection = dataSource.getConnection();

In order to plug-in your own implementation of the DataSource interface, your application may use an existing JNDI provider as provided by an application server if running in a J2EE environment.

In case your application does not have an existing JNDI implementation to register a DataSource to provide connections, you may set the java.naming.factory.initial property in the configuration to point to your application's own implementation of the javax.naming.spi.InitialContextFactory interface that can return your application DataSource though the javax.naming.Context provided by the factory implementation. Please see Java Naming and Directory Interface (JNDI) API documentation for further information.

16.4.11.2. Connections Obtained via DataSource Factory

This section describes how to use Apache Commons Database Connection Pooling (Apache DBCP). It explains how to provide a custom application-specific DataSource factory if not using Apache DBCP.

If your DataSource provides connections out of a connection pool, your configuration should set the collection lifecycle setting to pooled.

Apache DBCP provides comprehensive means to test for dead connections or grow and shrik a connection pool. Configuration properties for Apache DBCP can be found at Apache DBCP configuration. The listed properties are passed to Apache DBCP via the properties list provided as part of the configuration.

The snippet of XML below is an example that configures a database named mydb3 to obtain connections via the pooling DataSource provided by Apache DBCP BasicDataSourceFactory.

The listed properties are passed to DBCP to instruct DBCP how to manage the connection pool. The settings below initialize the connection pool to 2 connections and provide the validation query select 1 from dual for DBCP to validate a connection before providing a connection from the pool:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb3">
      <!-- For a complete list of properties see Apache DBCP. -->
      <!-- NOTE: "dbcp2" applies to api-2.0 of DBCP, use "dbcp" otherwise. -->
      <datasourcefactory-connection class-name="org.apache.commons.dbcp2.BasicDataSourceFactory">	
        <env-property name="username" value ="myusername"/>
        <env-property name="password" value ="mypassword"/>
        <env-property name="driverClassName" value ="com.mysql.jdbc.Driver"/>
        <env-property name="url" value ="jdbc:mysql://localhost/test"/>
        <env-property name="initialSize" value ="2"/>
        <env-property name="validationQuery" value ="select 1 from dual"/>
      </datasourcefactory-connection>
      <connection-lifecycle value="pooled"/>
    </database-reference>
  </common>
</esper-configuration>

The same configuration options provided through the API:

Properties props = new Properties();
props.put("username", "myusername");
props.put("password", "mypassword");
props.put("driverClassName", "com.mysql.jdbc.Driver");
props.put("url", "jdbc:mysql://localhost/test");
props.put("initialSize", 2);
props.put("validationQuery", "select 1 from dual");

ConfigurationCommonDBRef configDB = new ConfigurationCommonDBRef();
// BasicDataSourceFactory is an Apache DBCP import
configDB.setDataSourceFactory(props, BasicDataSourceFactory.class.getName());
configDB.setConnectionLifecycleEnum(ConfigurationCommonDBRef.ConnectionLifecycleEnum.POOLED);

Configuration configuration = new Configuration();;
configuration.getCommon().addDatabaseReference("mydb3", configDB);

Apache Commons DBCP is a separate download and not provided as part of the distribution. The Apache Commons DBCP jar file requires the Apache Commons Pool jar file.

Your application can provide its own factory implementation for DataSource instances: Set the class name to the name of the application class that provides a public static method named createDataSource which takes a single Properties object as parameter and returns a DataSource implementation. For example:

configDB.setDataSourceFactory(props, MyOwnDataSourceFactory.class.getName());
...
class MyOwnDataSourceFactory {
  public static DataSource createDataSource(Properties properties) {
    return new MyDataSourceImpl(properties);
  }
}

16.4.11.3. Connections Obtained via DriverManager

The next snippet of XML configures a database named mydb2 to obtain connections via java.sql.DriverManager. The drivermanager-connection element instructs the runtime to obtain new connections to the database mydb2 by means of Class.forName and DriverManager.getConnection using the class name, URL and optional username, password and connection arguments.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb2">
      <drivermanager-connection class-name="my.sql.Driver" 
            url="jdbc:mysql://localhost/test?user=root&amp;password=mypassword" 
            user="myuser" password="mypassword">
        <connection-arg name="user" value ="myuser"/>
        <connection-arg name="password" value ="mypassword"/>
        <connection-arg name="somearg" value ="someargvalue"/>
      </drivermanager-connection>
    </database-reference>
  </common>
</esper-configuration>

The username and password are shown in multiple places in the XML only as an example. Please check with your database software on the required information in URL and connection arguments.

16.4.11.4. Connections-Level Settings

Additional connection-level settings can optionally be provided to the runtime which the runtime will apply to new connections. When the runtime obtains a new connection, it applies only those settings to the connection that are explicitly configured. The runtime leaves all other connection settings at default values.

The below XML is a sample of all available configuration settings. Please refer to the Java API JavaDocs for java.sql.Connection for more information to each option or check the documentation of your JDBC driver and database software.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb2">
    <!-- ... configure data source or driver manager settings... -->
      <connection-settings auto-commit="true" catalog="mycatalog" 
          read-only="true" transaction-isolation="1" />
    </database-reference>
  </common>
</esper-configuration>

The read-only setting can be used to indicate to your database runtime that SQL statements are read-only. The transaction-isolation and auto-commit help you database software perform the right level of locking and lock release. Consider setting these values to reduce transactional overhead in your database queries.

16.4.11.5. Connections Lifecycle Settings

By default the runtime retains a separate database connection for each started statement. However, it is possible to override this behavior and require the runtime to obtain a new database connection for each lookup, and to close that database connection after the lookup is completed. This often makes sense when you have a large number of statements and require pooling of connections via a connection pool.

In the pooled setting, the runtime obtains a database connection from the data source or driver manager for every query, and closes the connection when done, returning the database connection to the pool if using a pooling data source.

In the retain setting, the runtime retains a separate dedicated database connection for each statement and does not close the connection between uses.

The XML for this option is below. The connection lifecycle allows the following values: pooled and retain.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb2">
    <!-- ... configure data source or driver manager settings... -->
        <connection-lifecycle value="pooled"/>
    </database-reference>
  </common>
</esper-configuration>

16.4.11.6. Cache Settings

Cache settings can dramatically reduce the number of database queries that the runtime executes for statements. If no cache setting is specified, the runtime does not cache query results and executes a separate database query for every event.

Caches store the results of database queries and make these results available to subsequent queries using the exact same query parameters as the query for which the result was stored. If your query returns one or more rows, the cache keep the result rows of the query keyed to the parameters of the query. If your query returns no rows, the cache also keeps the empty result. Query results are held by a cache until the cache entry is evicted. The strategies available for evicting cached query results are listed next.

16.4.11.6.1. LRU Cache

The least-recently-used (LRU) cache is configured by a maximum size. The cache discards the least recently used query results first once the cache reaches the maximum size.

The XML configuration entry for a LRU cache is as below. This entry configures an LRU cache holding up to 1000 query results.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb">
    <!-- ... configure data source or driver manager settings... -->
      <lru-cache size="1000"/>
    </database-reference>  
  </common>
</esper-configuration>

16.4.11.6.2. Expiry-Time Cache

The expiry time cache is configured by a maximum age in seconds, a purge interval and an optional reference type. The cache discards (on the get operation) any query results that are older then the maximum age so that stale data is not used. If the cache is not empty, then every purge interval number of seconds the runtime purges any expired entries from the cache.

The XML configuration entry for an expiry-time cache is as follows. The example configures an expiry time cache in which prior query results are valid for 60 seconds and which the runtime inspects every 2 minutes to remove query results older then 60 seconds.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb">
    <!-- ... configure data source or driver manager settings... -->
        <expiry-time-cache max-age-seconds="60" purge-interval-seconds="120" />
    </database-reference>
  </common>
</esper-configuration>

By default, the expiry-time cache is backed by a java.util.WeakHashMap and thus relies on weak references. That means that cached SQL results can be freed during garbage collection.

Via XML or using the configuration API the type of reference can be configured to not allow entries to be garbage collected, by setting the ref-type property to hard:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb">
    <!-- ... configure data source or driver manager settings... -->
        <expiry-time-cache max-age-seconds="60" purge-interval-seconds="120" ref-type="hard"/>
    </database-reference>
  </common>
</esper-configuration>

The last setting for the cache reference type is soft: This strategy allows the garbage collection of cache entries only when all other weak references have been collected.

16.4.11.7. Column Change Case

This setting instructs the compiler to convert to lower- or uppercase any output column names returned by your database system. When using Oracle relational database software, for example, column names can be changed to lowercase via this setting.

A sample XML configuration entry for this setting is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb">
    <!-- ... configure data source or driver manager settings... -->
      <column-change-case value="lowercase"/>
    </database-reference>
  </common>
</esper-configuration>

16.4.11.8. SQL Types Mapping

For NEsper .NET this section is not applicable.

By providing a mapping of SQL types (java.sql.Types) to Java built-in types your code can avoid using sometimes awkward default database types and can easily change the way the compiler returns Java types for columns returned by a SQL query.

The mapping maps a constant as defined by java.sql.Types to a Java built-in type of any of the following Java type names: String, BigDecimal, Boolean, Byte, Short, Int, Long, Float, Double, ByteArray, SqlDate, SqlTime, SqlTimestamp. The Java type names are not case-sensitive.

A sample XML configuration entry for this setting is shown next. The sample maps Types.NUMERIC which is a constant value of 2 per JDBC API to the Java int type.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <database-reference name="mydb">
    <!-- ... configure data source or driver manager settings... -->
      <sql-types-mapping sql-type="2" java-type="int" />
    </database-reference>
  </common>
</esper-configuration>

16.4.11.9. Metadata Origin

This setting controls how the compiler retrieves SQL statement metadata from JDBC prepared statements.

Table 16.3. Syntax and Results of Aggregate Functions

Option	Description
default	By default, the compiler detects the driver name and queries prepared statement metadata if the driver is not an Oracle database driver. For Oracle drivers, the compiler uses lexical analysis of the SQL statement to construct a sample SQL statement and then fires that statement to retrieve statement metadata.
metadata	The compiler always queries prepared statement metadata regardless of the database driver used.
sample	The compiler always uses lexical analysis of the SQL statement to construct a sample SQL statement, and then fires that statement to retrieve statement metadata.

16.4.12. Common Settings Related to Logging

16.4.12.1. Query Plan Logging

By default, the compiler does not produce query plan output unless logging at debug-level. To enable query plan logging, set this option in the configuration. When enabled, the compiler reports, at INFO level, any query plans under the log name com.espertech.esper.queryplan.

Query plan logging is applicable to subqueries, joins (any type), named window and table on-actions (on-select, on-merge, on-insert, on-update, on-select) and fire-and-forget queries. It is not applicable and will not provide additional information for other types of constructs.

The API to use to enable query plan logging is shown here:

Configuration config = new Configuration();
config.getCommon().getLogging().setEnableQueryPlan(true);

The XML snippet is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <logging>
      <query-plan enabled="true"/>
    </logging>
  </common>
</esper-configuration>

16.4.12.2. JDBC Logging

By default, the compiler and runtime does not measure JDBC query execution times or report the number of rows returned from a JDBC query through logging. To enable JDBC logging, set this option in the configuration. When enabled, the compiler and runtime report, at INFO level, any JDBC query performance and number of rows returned under the log name com.espertech.esper.jdbc.

The API to use to enable JDBC query logging is shown here:

Configuration config = new Configuration();
config.getCommon().getLogging().setEnableJDBC(true);

The XML snippet is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
     <logging>
       <jdbc enabled="true"/>
    </logging>
  </common>
</esper-configuration>

16.4.13. Common Settings Related to Time Source

16.4.13.1. Time Unit

The default time unit of time resolution is milliseconds. Your application may set the time resolution to microseconds instead.

A sample XML configuration for millisecond time resolution is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <time-source>
      <time-unit value="milliseconds"/>
    </time-source>
  </common>
</esper-configuration>

The equivalent code snippet using the configuration API is here:

Configuration config = new Configuration();
config.getCommon().getTimeSource().setTimeUnit(TimeUnit.MILLISECONDS);

16.4.14. Variables

Variables can be created dynamically in EPL via the create variable syntax but can also be configured.

A variable is declared by specifying a variable name, the variable type, an optional initialization value and an optional boolean-type flag indicating whether the variable is a constant (false by default). The initialization value can be of the same or compatible type as the variable type, or can also be a String value that, when parsed, is compatible to the type declared for the variable. Declare each variable a constant to achieve the best performance.

In a XML configuration file the variable configuration may look as below.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <variable name="var_threshold" type="long" initialization-value="100"/>
    <variable name="var_key" type="string"/>
    <variable name="test" type="int" constant="true"/>  
  </common>
</esper-configuration>

Please find the list of valid values for the type attribute in Section 16.8, “Type Names”.

16.4.15. Variant Stream

A variant stream is a predefined stream into which events of multiple disparate event types can be inserted, and which can be selected from in patterns and the from clause.

The name of the variant stream and, optionally, the type of events that the stream may accept, are part of the stream definition. By default, the variant stream accepts only the predefined event types. The compiler validates your insert into clause which inserts into the variant stream against the predefined types.

A variant stream can be set to accept any type of event, in which case all properties of the variant stream are effectively dynamic properties. Set the type variance flag to ANY to indicate the variant stream accepts any type of event.

The following XML configuration defines a variant stream by name OrderStream that carries only PartsOrder and ServiceOrder events:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <common>
    <variant-stream name="OrderStream">
      <variant-event-type name="PartsOrder"/>
      <variant-event-type name="ServiceOrder"/>
    </variant-stream>
  </common>
</esper-configuration>

This code snippet sets up a variant stream by name OutgoingEvent:

Configuration config = new Configuration();
ConfigurationCommonVariantStream variant = new ConfigurationCommonVariantStream();
variant.setTypeVariance(ConfigurationCommonVariantStream.TypeVariance.ANY);
config.getCommon().addVariantStream("OutgoingEvent", variant);

If specifying variant event type names, make sure such names have been configured for JavaBean, Map or XML events.

16.5. Configuration Compiler

16.5.1. Compiler Settings Related to Byte Code Generation

16.5.1.1. Byte Code General Settings

The setting include-debugsymbols is false by default. It controls whether the compiler generates debug symbols as part of the binary class.

The setting include-comments is false by default. It controls whether the compiler generates code that contains additional information to help tracing back generated code to the code that generated it.

The setting attach-epl is true by default. It controls whether the compiler adds the statement text of the statement to statement properties.

The setting attach-module-epl is false by default. It controls whether the compiler adds the EPL module text of the module to module properties.

The setting allow-subscriber is false by default. It controls whether the compiler adds code for handling subscribers. If this flag is false the setSubscriber method on the EPStatement class throws an exception.

The setting threadpool-compiler-num-threads sets the number of threads for compiling a statement to byte code and is eight (8) by default. Setting this value to zero disables multi-threading for compilation. When the number of threads is greater zero the calling thread generates classes for statements and the thread pool compiles statement classes to byte code. This setting improves compilation performance only when a module has multiple statements as the unit of parallelization is the statement. The setting threadpool-compiler-capacity defines the number of permits (capacity of the queue) for compiling statements to byte code and is unbound by default. Use null to represent unbound. The minimum value for capacity is one.

The sample code below sets the same values as the default values:

Configuration configuration = new Configuration();
ConfigurationCompilerByteCode byteCode = configuration.getCompiler().getByteCode();
byteCode.setIncludeDebugSymbols(false);
byteCode.setIncludeComments(false);
byteCode.setAttachEPL(true);
byteCode.setAttachModuleEPL(false);
byteCode.setAllowSubscriber(false);
byteCode.setInstrumented(false);
byteCode.setThreadPoolCompilerNumThreads(8);
byteCode.setThreadPoolCompilerCapacity(null);

The sample XML configuration below also sets default values:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <bytecode 
      include-comments="false" 
      include-debugsymbols="false"
      attach-epl="true" 
      attach-module-epl="false" 
      instrumented="false" 
      allow-subscriber="false"
      threadpool-compiler-num-threads="8"/>
  </compiler>
</esper-configuration>

16.5.1.2. Byte Code Modifier Settings

Access modifiers default to private and are listed here. You may also use the @private, @protected and @public annotations or the CompilerOptions object to set access modifiers.

Table 16.4. Byte Code Access Modifiers

Name	Description
`access-modifier-context`	Whether contexts that may be declared by the module are visible to other modules.
`access-modifier-eventtype`	Whether event types that may be declared by the module are visible to other modules.
`access-modifier-expression`	Whether expressions that may be declared by the module are visible to other modules.
`access-modifier-namedwindow`	Whether named windows that may be declared by the module are visible to other modules.
`access-modifier-script`	Whether scripts that may be declared by the module are visible to other modules.
`access-modifier-table`	Whether tables that may be declared by the module are visible to other modules.
`access-modifier-variables`	Whether variables that may be declared by the module are visible to other modules.

The setting bus-modifier-event-type is set to hidden by default. This means that any of the sendEventType method of EPEventService cannot be used to process events of that event type. Set this value to visible to indicate that the respective sendEventType method of EPEventService can process events of event types declared by the module (sendEventType throws an exception if it does not find a visible event type). You may also use the @buseventtype annotation or the CompilerOptions object to set bus event type visibility.

The sample code below sets the same values as the default values:

Configuration configuration = new Configuration();
ConfigurationCompilerByteCode byteCode = configuration.getCompiler().getByteCode();
byteCode.setAccessModifierContext(NameAccessModifier.PRIVATE);
byteCode.setAccessModifierEventType(NameAccessModifier.PRIVATE);
byteCode.setAccessModifierNamedWindow(NameAccessModifier.PRIVATE);
byteCode.setAccessModifierScript(NameAccessModifier.PRIVATE);
byteCode.setAccessModifierTable(NameAccessModifier.PRIVATE);
byteCode.setAccessModifierVariable(NameAccessModifier.PRIVATE);
byteCode.setBusModifierEventType(EventBusVisibility.HIDDEN);

The sample XML configuration below also sets default values:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <bytecode 
      access-modifier-context="private" 
      access-modifier-event-type="private" 
      access-modifier-expression="private" 
      access-modifier-named-window="private" 
	  access-modifier-script="private" 
	  access-modifier-table="private" 
	  access-modifier-variable="private" 
	  bus-modifier-event-type="hidden"
	  />
  </compiler>
</esper-configuration>

16.5.2. Compiler Settings Related to View Resources

16.5.2.1. Iterator Behavior For Unbound Streams

By default, when using the iterator API to iterate a statement with an unbound stream the runtime returns an empty iterator.

To have the runtime return the last event instead, please use the @IterableUnbound statement annotation or enable the compiler setting as described herein.

A code sample that turns iterable-unbound on is:

Configuration config = new Configuration();
config.getCompiler().getViewResources().setIterableUnbound(true);

16.5.2.2. Configuring Output Rate Limiting Options

This flag impacts output rate limiting as further outlined in Appendix B, Runtime Considerations for Output Rate Limiting. The flag serves to control the default behavior for output rate limiting for all statements that do not specify a hint.

If set to true (the default), all statements behave as if they hint @Hint('enable_outputlimit_opt').

If set to false, all statements behave as if they hint @Hint('disable_outputlimit_opt').

Here is the setting to allow multiple data windows without the intersection default:

Configuration config = new Configuration();
config.getCompiler().getViewResources().setOutputLimitOpt(true);

16.5.3. Compiler Settings Related to Logging

16.5.3.1. Byte Code Generation Logging

By enabling this setting the compiler logs byte code generation information at INFO level. This setting is disabled by default.

The API to use to enable logging for generated code is shown here:

Configuration config = new Configuration();
config.getCompiler().getLogging().setEnableCode(true);

The XML snippet is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
      <logging>
        <code enabled="true"/>
      </logging>
  </compiler>
</esper-configuration>

16.5.4. Compiler Settings Related to Stream Selection

16.5.4.1. Default Statement Stream Selection

Statements can produce both insert stream (new data) and remove stream (old data) results. Remember that insert stream refers to arriving events and new aggregation values, while remove stream refers to events leaving data windows and prior aggregation values. By default, the runtime delivers only the insert stream to listeners and observers of a statement.

There are keywords in the select clause that instruct the runtime to not generate insert stream and/or remove stream results if your application does not need either one of the streams. These keywords are the istream, rstream and the irstream keywords.

By default, the runtime only generates insert stream results equivalent to using the optional istream keyword in the select clause. If you application requires insert and remove stream results for many statements, your application can add the irstream keyword to the select clause of each statement, or you can set a new default stream selector via this setting.

The XML configuration for this setting is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <stream-selection>
      <stream-selector value="irstream" />
    </stream-selection>
  </compiler>
</esper-configuration>

The equivalent code snippet using the configuration API is here:

Configuration config = new Configuration();
config.getCompiler().getStreamSelection()
    .setDefaultStreamSelector(StreamSelector.RSTREAM_ISTREAM_BOTH);

16.5.5. Compiler Settings Related to Language and Locale

Locale-dependence in the compiler can be present in the sort order of string values by the order by clause and by the sort window.

By default, the runtime sorts string values using the compare method that is not locale dependent. To enable local dependent sorting you must set the configuration flag as described below.

The XML configuration sets the locale dependent sorting as shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <language sort-using-collator="true"/>
  </compiler>
</esper-configuration>

By default the compiler returns double-typed values for divisions regardless of operand types. Division by zero returns positive or negative double infinity.

Configuration config = new Configuration();
config.getCompiler().getLanguage().setSortUsingCollator(true);

16.5.6. Compiler Settings Related to Expression Evaluation

16.5.6.1. Integer Division and Division by Zero

To have compiler use Java-standard integer division instead, use this setting as described here. In Java integer division, when dividing integer types, the result is an integer type. This means that if you divide an integer unevenly by another integer, it returns the whole number part of the result, does not perform any rounding and the fraction part is dropped. If Java-standard integer division is enabled, when dividing an integer numerator by an integer denominator, the result is an integer number. Thus the expression 1 / 4 results in an integer zero. Your EPL must then convert at least one of the numbers to a double value before the division, for example by specifying 1.0 / 4 or by using cast(myint, double).

When using Java integer division, division by zero for integer-typed operands always returns null. However division by zero for double-type operands still returns positive or negative double infinity. To also return null upon division by zero for double-type operands, set the flag to true as below (default is false).

The XML configuration is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <expression integer-division="false" division-by-zero-is-null="false"/>
  </compiler>
</esper-configuration>

By default runtime caches the result of an user-defined function if the parameter set to that function is empty or all parameters are constant values. Results of custom plug-in single-row functions are not cached according to the default configuration, unless the single-row function is explicitly configured with value cache enabled.

Configuration config = new Configuration();
config.getCompiler().getExpression().setIntegerDivision(true);
config.getCompiler().getExpression().setDivisionByZeroReturnsNull(true);

16.5.6.2. User-Defined Function or Static Method Cache

To have rntime evaluate the user-defined function regardless of constant parameters, set the flag to false as indicated herein.

The XML configuration as below sets the same as the default value:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <expression udf-cache="true"/>
  </compiler>
</esper-configuration>

16.5.6.3. Extended Built-in Aggregation Functions

By default EPL provides a number of additional aggregation functions over the SQL standards. To have the compiler only allow the standard SQL aggregation functions and not the additional ones, disable the setting as described here.

The XML configuration as below sets the same as the default value:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <expression extended-agg="true"/>
  </compiler>
</esper-configuration>

16.5.6.4. Duck Typing

By default the compiler validates method references when using the dot operator syntax at time of compilation. With duck typing, the compiler resolves method references at runtime.

The XML configuration as below sets the same as the default value:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <expression ducktyping="false"/>
  </compiler>
</esper-configuration>

16.5.6.5. Math Context

By default, when computing the average of BigDecimal values, the compiler does not pass a java.math.MathContext. Use the setting herein to specify a default math context.

The below XML configuration sets precision to 2 and rounding mode ceiling:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <expression math-context="precision=2 roundingMode=CEILING"/>
  </compiler>
</esper-configuration>

An example API configuration is shown next:

Configuration config = new Configuration();
config.getCompiler().getExpression().setMathContext(MathContext.UNLIMITED);

16.5.7. Compiler Settings Related to Scripts

You may configure a default script dialect as described herein. The default script dialect is js which stands for JavaScript, since most JVM ship with an integrated JavaScript execution runtime.

The default value for the enabled setting is true thus the compiler allows scripts. By setting enabled to false the compiler disallows script use entirely.

A sample XML configuration for this setting is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <scripts default-dialect="js" enabled="true"/>
  </compiler>
</esper-configuration>

A sample code snippet that sets a new script dialect is:

Configuration config = new Configuration();
config.getCompiler().getScripts().setDefaultDialect("js");
config.getCompiler().getScripts().setEnabled(true);

16.5.8. Compiler Settings Related to Execution of Statements

16.5.8.1. Filter Service Max Filter Width

This setting is for performance tuning of filter expression analysis and breakdown.

In the default configuration the setting is 16, which means that the filter expression analyzer can at most create 16 path expressions from a given filter expression. If the number of path expressions is over 16, the expression will instead be evaluated as non-path and not be subject to to be entered into filter indexes.

On the level of a statement, this setting can be controlled by providing a hint. For example:

// The compiler optimizes the filter expression to become:
//   "a=1, c=1" or "b=1, c=1" or "a=1, d=1" or "b=1, d=1".
//   This enables filter index sharing between filter expressions.
select * from Event((a=1 or b=1) and (c=1 or d=1))

// The compiler does not optimize filter expressions
@Hint('MAX_FILTER_WIDTH=0') select * from Event((a=1 or b=1) and (c=1 or d=1))

The XML configuration to sets a new compiler value:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <execution filter-service-max-filter-width="100"/>
  </compiler>
</esper-configuration>

The enable-declared-expr-value-cache is true by default and the compile generates code such that it uses a declared-expression cache.

Configuration config = new Configuration();
config.getCompiler().getExecution().
    setFilterServiceMaxFilterWidth(16);

16.5.8.2. Declared Expression Value Cache

The XML configuration to sets the same value as the default:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <execution enable-declared-expr-value-cache="true"/>
  </compiler>
</esper-configuration>

In multithreaded environments, this setting controls whether dispatches of statement result events to listeners preserve the ordering in which a statement processes events. By default the runtime guarantees that it delivers a statement's result events to statement listeners in the order in which the result is generated. This behavior can be turned off via configuration as below. This behavior applies to stateful statements and not to stateless statements as stateless statements execute lock-free.

Configuration config = new Configuration();
config.getCompiler().getExecution().
    setEnabledDeclaredExprValueCache(true);

16.6. Configuration Runtime

16.6.1. Runtime Settings Related to Concurrency and Threading

16.6.1.1. Preserving the Order of Events Delivered to Listeners

The next code snippet shows how to control this feature:

Configuration config = new Configuration();
config.getRuntime().getThreading().setListenerDispatchPreserveOrder(false);

And the XML configuration file can also control this feature by adding the following elements:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading>
      <listener-dispatch preserve-order="true" timeout-msec="1000" locking="spin"/>
    </threading>
  </runtime>
</esper-configuration>

As discussed, by default the runtime can temporarily block another processing thread when delivering result events to listeners in order to preserve the order in which results are delivered to a given statement. The maximum time the runtime blocks a thread can also be configured, and by default is set to 1 second.

As such delivery locks are typically held for a very short amount of time, the default blocking technique employs a spin lock (There are two techniques for implementing blocking; having the operating system suspend the thread until it is awakened later or using spin locks). While spin locks are CPU-intensive and appear inefficient, a spin lock can be more efficient than suspending the thread and subsequently waking it up, especially if the lock in question is held for a very short time. That is because there is significant overhead to suspending and rescheduling a thread.

The locking technique can be changed to use a blocking strategy that suspends the thread, by means of setting the locking property to 'suspend'.

16.6.1.2. Preserving the Order of Events for Insert-Into Streams

In multithreaded environments, this setting controls whether statements producing events for other statements via insert-into preserve the order of delivery within the producing and consuming statements, allowing statements that consume other statement's events to behave deterministic in multithreaded applications, if the consuming statement requires such determinism. By default, the runtime makes this guarantee (the setting is on). This behavior applies to stateful statements and not to stateless statements as stateless statements execute lock-free.

Take, for example, an application where a single statement (S1) inserts events into a stream that another statement (S2) further evaluates. A multithreaded application may have multiple threads processing events into statement S1. As statement S1 produces events for consumption by statement S2, such results may need to be delivered in the exact order produced as the consuming statement may rely on the order received. For example, if the first statement counts the number of events, the second statement may employ a pattern that inspects counts and thus expect the counts posted by statement S1 to continuously increase by 1 even though multiple threads process events.

The runtime may need to block a thread such that order of delivery is maintained, and statements that require order (such as pattern detection, previous and prior functions) receive a deterministic order of events. The settings available control the blocking technique and parameters. As described in the section immediately prior, the default blocking technique employs spin locks per statement inserting events for consumption, as the locks in questions are typically held a very short time. The 'suspend' blocking technique can be configured and a timeout value can also defined.

The XML configuration file may change settings via the following elements:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading>
      <insert-into-dispatch preserve-order="true" timeout-msec="100" locking="spin"/>
    </threading>
  </runtime>
</esper-configuration>

16.6.1.3. Preserving the Order of Named Window Dispatches to Named Window Consumer Statements

In multithreaded environments, this setting controls whether named windows producing insert and remove streams for other statements that consume the named window by means of from-clause preserve the order of delivery within the producing named window and the consuming statements, allowing statements that consume named window's insert and remove stream events to behave deterministic in multithreaded applications, if the consuming statement requires such determinism. By default, the runtime makes this guarantee (the setting is on) with spin locking and Long.MAX_VALUE as millisecond timeout.

Take, for example, an application where a named window (W1) produces inserts and remove stream events that a statement (S1) consumes. A multithreaded application may have multiple threads producing insert and remove stream events for consumption by statement S1. Such results may need to be delivered in the exact order produced by the named window as the consuming statement may rely on the order received.

The runtime may need to block a thread such that order of delivery is maintained, and statements that require order receive a deterministic order of events. The settings available control the blocking technique and parameters. As described in the section immediately prior, the default blocking technique employs spin locks per named window producing insert and removed stream events for consumption, as the locks in questions are typically held a very short time. The 'suspend' blocking technique can be configured and a timeout value can also defined.

The XML configuration file may change settings via the following elements:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading>
      <named-window-consumer-dispatch preserve-order="true" locking="spin"/>
    </threading>
  </runtime>
</esper-configuration>

16.6.1.4. Internal Timer Settings

This option can be used to disable the internal timer thread and such have the application supply external time events, as well as to set a timer resolution.

The next code snippet shows how to disable the internal timer thread via the configuration API:

Configuration config = new Configuration();
config.getRuntime().getThreading().setInternalTimerEnabled(false);

This snippet of XML configuration leaves the internal timer enabled (the default) and sets a resolution of 200 milliseconds (the default is 100 milliseconds):

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading>
      <internal-timer enabled="true" msec-resolution="200"/>
    </threading>
  </runtime>
</esper-configuration>

We recommend that when disabling the internal timer, applications send an external timer event setting the start time before creating statements, such that statement start time is well-defined.

16.6.1.5. Advanced Threading Options

The settings described herein are for enabling advanced threading options for inbound, outbound, timer and route executions.

Take the next snippet of XML configuration as an example. It configures all threading options to 2 threads, which may not be suitable to your application, however demonstrates the configuration:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading>
      <threadpool-inbound enabled="true" num-threads="2"/>
      <threadpool-outbound enabled="true" num-threads="2" capacity="1000"/>
      <threadpool-timerexec enabled="true" num-threads="2"/>
      <threadpool-routeexec enabled="true" num-threads="2"/>
    </threading>
  </runtime>
</esper-configuration>

By default, queues are unbound and backed by java.util.concurrent.LinkedBlockingQueue. The optional capacity attribute can be set to instruct the threading option to configure a capacity-bound queue with a sender-wait (blocking put) policy, backed ArrayBlockingQueue.

This example uses the API for configuring inbound threading :

Configuration config = new Configuration();
config.getRuntime().getThreading().setThreadPoolInbound(true);
config.getRuntime().getThreading().setThreadPoolInboundNumThreads(2);

With a bounded work queue, the queue size and pool size should be tuned together. A large queue coupled with a small pool can help reduce memory usage, CPU usage, and context switching, at the cost of potentially constraining throughput.

Note

If outbound-threading is enabled, listeners and subscribers that send events back into the runtime should use the sendEventType method and not the routeEvent method.

16.6.1.6. Runtime Fair Locking

By default the runtime configures the runtime-level lock without fair locking. The runtime-level lock coordinates event processing threads (threads that send events) with threads that perform administrative functions (threads that deploy and undeploy statements, for example). A fair lock is generally less performing that an unfair lock thus the default configuration is an unfair lock.

If your application is multi-threaded and multiple threads sends events without gaps and if the per-event processing time is significant, then configuring a fair lock can help prioritize administrative functions. Administrative functions exclude event-processing threads until the administrative function completed. You may need to set this flag to prevent lock starvation to perform an administrative function in the face of concurrent event processing. Please consult the Java API documentation under ReentrantReadWriteLock and Fair Mode for more information.

The XML configuration to enable fair locking, which is disabled by default, is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <threading runtime-fairlock="true"/>
  </runtime>
</esper-configuration>

By default, the runtime does not produce debug output for the event processing execution paths even when Log4j or Logger configurations have been set to output debug level logs. To enable debug level logging, set this option in the configuration as well as in your Log4j configuration file.

Configuration config = new Configuration();
config.getRuntime().getThreading().setRuntimeFairlock(true);

16.6.2. Runtime Settings Related to Logging

16.6.2.1. Execution Path Debug Logging

Statement-level processing information can be output via the @Audit annotation, please see Section 14.12.1, “@Audit Annotation”.

When debug-level logging is enabled by setting the flag as below and by setting DEBUG in the Log4j configuration file, then the timer processing may produce extensive debug output that you may not want to have in the log file. The timer-debug setting in the XML or via API as below disables timer debug output which is enabled by default.

The API to use to enable debug logging and disable timer event output is shown here:

Configuration config = new Configuration();
config.getRuntime().getLogging().setEnableExecutionDebug(true);
config.getRuntime().getLogging().setEnableTimerDebug(false);

Note: this is a configuration option that applies to all runtime instances of a given Java module or VM.

The XML snippet is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
      <logging>
        <execution-path enabled="true"/>
        <timer-debug enabled="false"/>
    </logging>
  </runtime>
</esper-configuration>

16.6.2.2. Audit Logging

The settings herein control the output format of @Audit logs.

This setting applies to all runtime instances in the same JVM. Please also see the API documentation for information on pattern conversion characters.

Table 16.5. Audit Log Conversion Characters

Character	Description
`m`	Audit message
`s`	Statement name
`u`	Runtime URI
`d`	Deployment Id
`i`	Context partition id
`c`	Category

The API to use to set am audit log format is shown here:

Configuration config = new Configuration();
config.getRuntime().getLogging().setAuditPattern("[%u] [%s] %m");

The XML snippet is:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <logging>
      <audit pattern="[%u] [%s]%m"/>
    </logging>
  </runtime>
</esper-configuration>

16.6.3. Runtime Settings Related to Variables

16.6.3.1. Variable Version Release Interval

This setting controls the length of time that the runtime retains variable versions for use by statements that use variables and that execute, within the same statement for the same event, longer then the time interval. By default, the runtime retains 15 seconds of variable versions.

For statements that use variables and that execute (in response to a single timer or other event) longer then the time period, the runtime returns the current variable version at the time the statement executes, thereby softening the guarantee of consistency of variable values within the long-running statement. Please see Section 5.17.3, “Using Variables” for more information.

The XML configuration for this setting is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <variables>
      <msec-version-release value="15000"/>
    </variables>
  </runtime>
</esper-configuration>

16.6.4. Runtime Settings Related to Patterns

16.6.4.1. Followed-By Operator Maximum Subexpression Count

You may use this setting to limit the total runtime-wide number of pattern sub-expressions that all followed-by operators may manage. When the limit is reached, a condition is raised by the runtime through the condition callback API.

By default, when the limit is reached, the runtime also prevents the start of new pattern sub-expressions, until pattern sub-expressions end and the limit is no longer reached. By setting the prevent-start flag to false you can instruct the runtime to only raise a condition and continue to allow the start of new pattern sub-expressions.

The implications of the settings described herein are also detailed in Section 7.5.8.2, “Limiting Runtime-Wide Sub-Expression Count”.

A sample XML configuration for this setting is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <patterns>
      <max-subexpression value="100" prevent-start="false"/>
    </patterns>
  </runtime>
</esper-configuration>

16.6.5. Runtime Settings Related to Match-Recognize

16.6.5.1. Maximum State Count

You may use this setting to limit the total runtime-wide number of states that all match-recognize constructs may manage. When the limit is reached, a condition is raised by the runtime through the condition callback API.

By default, when the limit is reached, the runtime also prevents the allocation of new states, until states get removed and the limit is no longer reached. By setting the prevent-start flag to false you can instruct the runtime to only raise a condition and continue to allow the allocation of new states.

The implications of the settings described herein are also detailed in Section 8.11, “Limiting Runtime-Wide State Count”.

A sample XML configuration for this setting is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <match-recognize>
      <max-state value="100" prevent-start="false"/>
    </match-recognize>
  </runtime>
</esper-configuration>

16.6.6. Runtime Settings Related to Time Source

16.6.6.1. Default Time Source

This setting only applies if internal timer events control runtime time (default). If external timer events provide runtime clocking, the setting does not apply.

By default, the internal timer uses the call System.currentTimeMillis() to determine runtime time in milliseconds. Via this setting the internal timer can be instructed to use System.nanoTime() instead. Please see Section 15.9.2, “Time Resolution and Time Unit” for more information.

Note: This is a Java VM global setting. If running multiple runtime instances in a Java VM, the timer setting is global and applies to all runtime instances in the same Java VM, for performance reasons.

A sample XML configuration for this setting is shown below, whereas the sample setting sets the time source to the nanosecond time provider:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <time-source>
      <time-source-type value="nano" />
    </time-source>
  </runtime>
</esper-configuration>

The equivalent code snippet using the configuration API is here:

Configuration config = new Configuration();
config.getRuntime().getTimeSource().setTimeSourceType(TimeSourceType.NANO);

16.6.7. Runtime Settings Related to JMX Metrics

Please set the flag as described herein to have the runtime report key counters and other processing information through the JMX mbean platform server. By default JMX is not enabled. For NEsper .NET this section does not apply and there is currently no equivalent.

A sample XML configuration is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <metrics-reporting jmx-runtime-metrics="true"/>
  </runtime>
</esper-configuration>

A sample code snippet to set this configuration via the API follows:

Configuration configuration = new Configuration();
configuration.getRuntime().getMetricsReporting().setJmxRuntimeMetrics(true);

16.6.8. Runtime Settings Related to Metrics Reporting

This section explains how to enable and configure metrics reporting, which is by default disabled. Please see Section 15.12, “Runtime and Statement Metrics Reporting” for more information on the metrics data reported to your application.

The flag that enables metrics reporting is global to a Java virtual machine. If metrics reporting is enabled, the overhead incurred for reporting metrics is carried by all runtime instances per Java VM.

Metrics reporting occurs by a runtime-controlled separate daemon thread that each runtime instance starts at runtime initialization time, if metrics reporting and threading is enabled (threading enabled is the default).

Runtime and statement metric intervals are in milliseconds. A negative or zero millisecond interval value may be provided to disable reporting.

To control statement metric reporting for individual statements or groups of statements, the runtime provides a facility that groups statements by statement name. Each such statement group may have different reporting intervals configured, and intervals can be changed at runtime through runtime configuration. A statement group is assigned a group name at configuration time to identify the group.

Metrics reporting configuration is part of the runtime settings. All configuration options are also available via the Configuration API.

A sample XML configuration is shown below:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <metrics-reporting enabled="true" runtime-interval="1000" statement-interval="1000" 
        threading="true"/>
  </runtime>
</esper-configuration>

The runtime-interval setting (defaults to 10 seconds) determines the frequency in milliseconds at which the runtime reports runtime metrics, in this example every 1 second. The statement-interval is for statement metrics. The threading flag is true by default since reporting takes place by a dedicated runtime thread and can be set to false to use the external or internal timer thread instead.

The next example XML declares a statement group: The statements that have statement names that fall within the group follow a different reporting frequency:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <metrics-reporting enabled="true" statement-interval="0">
      <stmtgroup name="MyStmtGroup" interval="2000" default-include="true" num-stmts="100" 
           report-inactive="true">
        <exclude-regex>.*test.*</exclude-regex>
      </stmtgroup>
    </metrics-reporting>
  </runtime>
</esper-configuration>

The above example configuration sets the statement-interval to zero to disable reporting for all statements. It defines a statement group by name MyStmtGroup and specifies a 2-second interval. The example sets the default-include flag to true (by default false) to include all statements in the statement group. The example also sets report-inactive to true (by default false) to report inactive statements.

The exclude-regex element may be used to specify a regular expression that serves to exclude statements from the group. Any statement whose statement name matches the exclude regular expression is not included in the group. In the above example, all statements with the characters 'test' inside their statement name are excluded from the group.

Any statement not belonging to any of the statement groups follow the configured statement interval.

There are additional elements available to include and exclude statements: include-regex, include-like and exclude-like. The latter two apply SQL-like matching. All patterns are case-sensitive.

Here is a further example of a possible statement group definition, which includes statements whose statement name have the characters @REPORT or @STREAM, and excludes statements whose statement name have the characters @IGNORE or @METRICS inside.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <metrics-reporting enabled="true">
      <stmtgroup name="MyStmtGroup" interval="1000">
        <include-like>%@REPORT%</include-like>
        <include-regex>.*@STREAM.*</include-regex>
        <exclude-like>%@IGNORE%</exclude-like>
        <exclude-regex>.*@METRICS.*</exclude-regex>
      </stmtgroup>
    </metrics-reporting>
  </runtime>
</esper-configuration>

16.6.9. Runtime Settings Related to Expression Evaluation

16.6.9.1. Subselect Evaluation Order

By default the runtime updates sub-selects with new events before evaluating the enclosing statement. This is relevant for statements that look for the same event in both the from clause and subselects.

To have runtime evaluate the enclosing clauses before updating the subselect in a subselect expression, set the flag as indicated herein.

The XML configuration as below sets the same as the default value:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <expression self-subselect-preeval="true"/>
  </runtime>
</esper-configuration>

Here is a sample statement that utilitzes a sub-select against the same-events:

select * from MyEvent where prop not in (select prop from MyEvent#unique(otherProp))

By default the subselect data window updates first before the where clause is evaluated, thereby above statement never returns results.

Changing the setting described here causes the where clause to evaluate before the subselect data window updates, thereby the statement does post results.

16.6.9.2. Time Zone

By default, when performing calendar operations, the runtime uses the default time zone obtained by java.util.TimeZone.getDefault(). Use the setting herein to specify a time zone other then the default time zone.

The below XML configuration sets a time zone 'GMT-4:00':

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <expression time-zone="GMT-4:00"/>
  </runtime>
</esper-configuration>

An example API configuration is shown next:

Configuration config = new Configuration();
config.getRuntime().getExpression().setTimeZone(TimeZone.getTimeZone("GMT-4:00"));

16.6.10. Runtime Settings Related to Execution of Statements

16.6.10.1. Prioritized Execution

By default the runtime ignores @Priority and @Drop annotations and executes unprioritized, that is the runtime does not attempt to interpret assigned priorities and reorder executions based on priority. Use this setting if your application requires prioritized execution.

By setting this configuration, the runtime executes statements, when an event or schedule matches multiple statements, according to the assigned priority, starting from the highest priority value. See built-in EPL annotations in Section 5.2.7.7, “@Priority”.

The XML configuration to enable the flag, which is disabled by default, is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <execution prioritized="true"/>
  </runtime>
</esper-configuration>

The API to change the setting:

Configuration config = new Configuration();
config.getRuntime().getExecution().setPrioritized(true);

16.6.10.2. Context Partition Fair Locking

By default the runtime configures context partition locks without fair locking. If your application is multi-threaded and performs very frequent reads via iterator or fire-and-forget queries, you may need to set this flag to prevent lock starvation in the face of concurrent reads and writes. Please consult the Java API documentation under ReentrantReadWriteLock and Fair Mode for more information.

The XML configuration to enable fair locking, which is disabled by default, is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <execution fairlock="true"/>
  </runtime>
</esper-configuration>

By default the runtime configures context partition locks as required after analyzing your statements. You may disable context partition locks using the setting described here. Use the @NoLock annotation instead to disable locking for a given statement or named window only.

Configuration config = new Configuration();
config.getRuntime().getExecution().setFairlock(true);

16.6.10.3. Disable Locking

Warning

The runtime provides this setting for the purpose of identifying locking overhead, or when your application is single-threaded, or when using an external mechanism for concurrency control. Setting disable-locking to true may have unpredictable results unless your application is taking concurrency under consideration.

The XML configuration to disable context level locking is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <execution disable-locking="true"/>
  </runtime>
</esper-configuration>

This setting is for performance tuning of filter service which handles matching incoming events to context partitions and statements.

Configuration config = new Configuration();
config.getRuntime().getExecution().setDisableLocking(true);

16.6.10.4. Filter Service Profile

In the default configuration termed readmostly, filter service locking is coarse-grained assuming a large number of reads and comparatively few writes. "Reads" are evaluations of events, while with "writes" we mean filter service changes such as new statements, a new pattern subexpression becoming active or a pattern subexpression being deactivated.

Set the configuration to readwrite if you have multiple threads and your statements very frequently add and remove filters using pattern subexpressions, for example. This setting instructs the runtime to maintain fine-grained locks instead generally allowing for higher concurrency but possibly incurring additional overhead.

The XML configuration to set a new filter service profile is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <execution filter-service-profile="readwrite"/>
  </runtime>
</esper-configuration>

In the default configuration the setting is 1, which means that for each declared expression the runtime retains a cache of only the last computed value, for use for the duration of an evaluation of an event or time against a context partition. You may set the value to zero to disable caching. You may set the value to N to instruct the runtime to retain a cache of the last N computed values. This setting is not applicable to stateful declared expressions such as declared expressions with aggregation functions, for example.

Configuration config = new Configuration();
config.getRuntime().getExecution().
    setFilterServiceProfile(FilterServiceProfile.READWRITE);

16.6.10.5. Declared Expression Value Cache Size

The XML configuration to sets the same value as the default:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <execution declared-expr-value-cache-size="1"/>
  </runtime>
</esper-configuration>

Use the settings as described here to register a condition handler factory class that provides a condition handler. The runtime invokes condition handlers in the order they are listed to indicate conditions, which is the term used for notification when certain predefined limits are reached, as further described in Section 15.11, “Condition Handling”.

Configuration config = new Configuration();
config.getRuntime().getExecution().
    setDeclaredExprValueCacheSize(1);

16.6.11. Runtime Settings Related to Exception Handling

Use the settings as described here to register an exception handler factory class that provides an exception handler. The runtime invokes exception handlers in the order they are listed to handle a continues-query unchecked exception, as further described in Section 15.10, “Exception Handling”. Please provide the full-qualified class name of each class that implements the com.espertech.esper.common.client.hook.exception.ExceptionHandlerFactory interface in the runtime configuration as below.

By default, during a module undeploy when the runtime encounters a runtime exception for any of the statements it logs such exceptions as warnings. You can set the undeploy-rethrow-policy flag to rethrow_first instead have the runtime rethrow the first runtime exception.

The XML configuration is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <exceptionHandling undeploy-rethrow-policy="warn">
      <handlerFactory class="my.company.cep.MyCEPRuntimeExceptionHandlerFactory"/>
    </exceptionHandling>
  </runtime>
</esper-configuration>

The API calls to register an exception handler factory are as follows:

Configuration config = new Configuration();
config.getRuntime().getExceptionHandling().addClass(MyCEPRuntimeExceptionHandlerFactory.class);
config.getRuntime().getExceptionHandling().setUndeployRethrowPolicy(UndeployRethrowPolicy.RETHROW_FIRST);

16.6.12. Runtime Settings Related to Condition Handling

Please provide the full-qualified class name of each class that implements the com.espertech.esper.common.client.hook.condition.ConditionHandlerFactory interface in the runtime configuration as below.

The XML configuration is as follows:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <runtime>
    <conditionHandling>
      <handlerFactory class="my.company.cep.MyCEPRuntimeConditionHandlerFactory"/>
    </conditionHandling>
  </runtime>
</esper-configuration>

The API calls to register a condition handler factory are as follows:

Configuration config = new Configuration();
config.getRuntime().getConditionHandling().addClass(MyCEPRuntimeConditionHandlerFactory.class);

16.7. Passing Services or Transient Objects

The Configuration object allows passing application objects such as services or other transient objects. This information can be used by extensions, listeners or subscribers, for example, to obtain application objects from the runtime. Your application may provide a custom class loader or class-for-name service.

Use setTransientConfiguration and provide a Map<String, Object> that contains the application objects. The runtime retains and makes available the same Map instance available via API. Its contents including services can be changed by an application at runtime. The API methods to retrieve transient configuration are:

The getConfigurationTransients method of EPRuntime
The getConfigurationDeepCopy method of EPRuntime

16.7.1. Service Example

Assuming your application has a service myLocalService instance, the example code is:

Configuration configuration = new Configuration();
HashMap<String, Object> transients = new HashMap<String, Object>();
transients.put(SERVICE_NAME, myLocalService); // SERVICE_NAME is a well-known string value defined elsewhere
configuration.getCommon().setTransientConfiguration(transients);

EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(configuration);

A sample listener that receives a service from transient configuation is:

public class MyListener implements UpdateListener {
  public void update(EventBean[] newEvents, EventBean[] oldEvents, EPStatement statement, EPRuntime runtime) {
    MyLocalService service = (MyLocalService) runtime.getConfigurationTransient().get(SERVICE_NAME);
    // use the service here
  }
}

An alternative means to obtain application services is to define a constant variable.

16.7.2. Class-for-Name

By default, when resolving a fully-qualified class name to a Class, the com.espertech.esper.common.client.util.ClassForNameProviderDefault uses:

ClassLoader cl = Thread.currentThread().getContextClassLoader();
return Class.forName(className, true, cl);

Your application can implement the com.espertech.esper.common.client.util.ClassForNameProvider interface to provide an alternate implementation.

For example, this provider prevents the System class from being available in EPL:

runtime.getConfigurationTransient().put(ClassForNameProvider.NAME,
  new ClassForNameProvider() {
    public Class classForName(String className) throws ClassNotFoundException {
      if (className.equals(System.class.getName())) { // prevent the System class from loading
        return null;
      }
      return Class.forName(className, true, Thread.currentThread().getContextClassLoader());
   }
});

16.7.3. Class Loader

By default, to obtain a class loader, the com.espertech.esper.common.client.util.ClassLoaderProviderDefault uses Thread.currentThread().getContextClassLoader().

Your application can implement the com.espertech.esper.common.client.util.ClassLoaderProvider interface to provide an alternate implementation.

For example, this provider returns a pre-determined classloader:

ClassLoader classLoader = new CustomClassLoader();
runtime.getConfigurationTransient().put(ClassLoaderProvider.NAME,
  new ClassLoaderProvider() {
    public ClassLoader classloader() {
      return classLoader;
    }
});

16.8. Type Names

Certain configuration values accept type names. Type names can occur in the configuration of variable types, Map-event property types as well as XPath cast types, for example. Types names are not case-sensitive.

The table below outlines all possible type names:

Table 16.6. Variable Type Names

Type Name	Type
`string`, `varchar`, `varchar2` or `java.lang.String`	A string value
`int`, `integer` or `java.lang.Integer`	An integer value
`long` or `java.lang.Long`	A long value
`bool`, `boolean` or `java.lang.Boolean`	A boolean value
`double` or `java.lang.Double`	A double value
`float` or `java.lang.Float`	A float value
`short` or `java.lang.Short`	A short value
`char`, `character` or `java.lang.Character`	A character value
`byte` or `java.lang.Byte`	A byte value

16.9. Logging Configuration

For NEsper .NET also see Section I.18, “.NET Configurations - Logging Configuration”.

The compiler and runtime log all messages to SLF4J under an appropriate log level. To output log messages you can add Log4j and SLF4J-Log4j (1.2) to classpath and configure Log4j as below.

The only direct dependency for logging is the SLF4J interfaces (slf4j-api-x.y.z.jar). Please see the SLF4J documentation on redirecting logs to other logging frameworks.

Statement-level processing information can be output, please see Section 14.12.1, “@Audit Annotation”.

For performance reasons, the runtime does not log any debug-level or informational-level messages for event execution unless explicitly configured via Section 16.6.2.1, “Execution Path Debug Logging”.

A callback API for receiving certain critical runtime reports is available as described in Section 15.10, “Exception Handling”.

More information on configuring runtime-level settings for logging are at Section 16.6.2, “Runtime Settings Related to Logging”.

The next table explains the log levels:

Table 16.7. Log Levels

Log Level	Use
Debug	Displays detailed runtime-internal information that may not be easy to understand for application developers but are useful for runtime support.
Info	Used for a few critical runtime-level log messages.
Warn	Certain important warning or informational messages are displayed under the warning level.
Error	Exceptions reported within the runtime or by plug-in components are reported under the error level.

16.9.1. Log4j Logging Configuration

Log4j is the default logging component. Please find additional information for Log4j configuration and extension in http://logging.apache.org/log4j.

The easiest way to configure Log4j is by providing a Log4J configuration file, similar to the log4j.xml file shipped in the etc folder of the distribution.

Add the log4j.configuration system property to the java command line and provide the file name of the Log4j configuration file, making sure your classpath also includes the directory of the file:

java -Dlog4j.configuration=log4j.xml ...

17.1. Overview

EPL allows the use scripting languages within EPL. You may use scripts for imperative programming to execute certain code as part of EPL processing by the runtime.

The syntax and examples outlined below discuss how to declare a script that is visible to the same statement that listed the script.

For declaring scripts that are visible across multiple statements i.e. globally visible scripts please consult Section 5.18.3, “Global Scripts” that explains the create expression clause.

Any scripting language that supports JSR 223 and also the MVEL scripting language can be specified in EPL. This section provides MVEL and JavaScript examples.

For more information on the MVEL scripting language and its syntax, please refer to the MVEL documentation. MVEL is an expression language that has a natural syntax for Java-based applications and compiles to provide fast execution times. To use MVEL with the runtime, please make sure to add the MVEL jar file to the application classpath.

For more information on JSR 223 scripting languages, please refer to external resources. As JSR 223 defines a standard API for script execution, your application may use any script execution that implements the API. Current JVM versions ship with a JavaScript script execution. Other script executors such as Groovy, Ruby and Python scripts can be used as implementations of JSR 223.

As an alternative to a script consider providing a custom single row function as described in Section 20.2, “Single-Row Function”

17.2. Syntax

The syntax for scripts is:

expression [return_type] [@type(eventtype_name)] [dialect_identifier:] script_name [ (parameters) ] [ script_body ]

Use the expression keyword to declare a script.

The return_type is optional. If the script declaration provides a return type the compiler can perform strong type checking: Any expressions that invoke the script and use the return value are aware of the return type. If no return type is provided the compiler assumes the script returns java.lang.Object.

If the return type of the script is EventBean[] you must provide the @type(name) annotation after the return type to name the event type of events returned by the script. The @type is allowed only when the return type is EventBean instances.

The dialect_identifier is optional and identifies the scripting language. Use mvel for MVEL , js for JavaScript and python for Python and similar for other JSR 223 scripting languages. If no dialect identifier is specified, the default dialect that is configured applies, which is js unless your application changes the default configuration.

It follows the script name. You may use the same script name multiple times and thus overload providing multiple signatures under the same script name. The combination of script name and number of parameters must be unique however.

If you have script parameters, specify the parameter names for the script as a comma-separated list of identifiers in parenthesis. It is not necessary to list parameter types.

The script body is the actual MVEL or JavaScript or other scripting language script and is placed in square brackets: [ ... script body ...].

17.3. Examples

The next example shows a statement that calls a JavaScript script which computes the Fibonacci total for a given number:

expression double js:fib(num) [
fib(num);
function fib(n) {
  if(n <= 1)
    return n;
  return fib(n-1) + fib(n-2);
}
]
select fib(intPrimitive) from SupportBean;

The expression keyword is followed by the return type (double), the dialect (js) and the script name (fib) that declares a single parameter (num). The JavaScript code that computes the Fibonacci total is between square brackets [].

The following example shows a statement that calls a MVEL script which outputs all the different colors that are listed in the colors property of each ColorEvent:

expression mvel:printColors(colors) [
String c = null;
for (c : colors) {
   System.out.println(c);
}
]
select printColors(colors) from ColorEvent;

This example instead uses JavaScript to print colors and passes the event itself as a script parameter (this example is for Java 8 and Nashorn):

expression js:printColors(colorEvent) [
  print(java.util.Arrays.toString(colorEvent.getColors()));
]
select printColors(colorEvent) from ColorEvent as colorEvent

The next example creates a globally-visible script that returns ItemEvent events, assuming that the ItemEvent event type is an event type defined by create schema ItemEvent(id string):

create expression EventBean[] @type(ItemEvent) js:myScriptReturnsEvents() [
myScriptReturnsEvents();
function myScriptReturnsEvents() {
  var EventBeanArray = Java.type(\"com.espertech.esper.common.client.EventBean[]\");
  var events = new EventBeanArray(1);
  events[0] = epl.getEventBeanService().adapterForMap(java.util.Collections.singletonMap(\"id\", \"id1\"), \"ItemEvent\");
  return events;
}

// sample EPL:
// select myScriptReturnsEvents().where(v => v.id in ('id1', 'id3')) from MyEvent]

17.4. Built-In EPL Script Attributes

The compiler provides a built-in script object under the variable name epl to all scripts. Your scripts may use this script object to share and retain state by setting and reading script attributes. The runtime maintains a separate script object per context partition or per statement if not declaring a context. Therefore script attributes are not shared between statements, however multiple scripts executed by the same context partition receive the same script object.

The epl script object implements the interface com.espertech.esper.common.client.hook.expr.EPLScriptContext. Please see the JavaDoc for services provided by EPLScriptContext.

For script state management, the EPLScriptContext interface has two methods: The void setScriptAttribute(String attribute, Object value) method to set an attribute value and the Object getScriptAttribute(String attribute) method to read an attribute value.

The next example demonstrates the use of the epl script object. It outputs a flag value true when an RFID event matched because the location is A, and outputs a flag value false when an RFID event matched because the location is B. The example works the same for either MVEL or JavaScript dialects: You may simple replace the js dialect with mvel.

expression boolean js:setFlag(name, value, returnValue) [
  if (returnValue) epl.setScriptAttribute(name, value);
  returnValue;
]
expression js:getFlag(name) [
  epl.getScriptAttribute(name);
]
select getFlag('locA') as flag from RFIDEvent(zone = 'Z1' and
  (setFlag('locA', true, location = 'A') or setFlag('locA', false, location = 'B')) )

The example above utilizes two scripts: The setFlag script receives an attribute name, attribute value and a return value. The script sets the script attribute only when the return value is true. The getFlag script simply returns the script attribute value.

17.5. Performance Notes

Upon statement compilation, the compiler resolves script parameter types and performs script compilation. At runtime the runtime evaluates the script in its compiled form.

As the compiler cannot inspect scripts if is not possible for the compiler to perform query planning or many optimizations based on the information in scripts. It is thus recommended to structure EPL such that basic filter and join expressions are EPL expressions and not script expressions.

17.6. Additional Notes

Your EPL may declare a return type for the script. If no return type is declared and when using the MVEL dialect, the compiler will infer the return type from the MVEL expression analysis result. If the return type is not provided and cannot be inferred or the dialect is not MVEL, the return type is Object.

If the EPL declares a numeric return type then the compiler performs coercion of the numeric result to the return type that is specified.

In the case that the EPL declares a return type that does not match the type of the actual script return value, the compiler does not check return value type.

Chapter 18. EPL Reference: Spatial Methods and Indexes

18.1. Overview

18.2. Spatial Methods

18.2.1. Point-Inside-Rectangle
18.2.2. Rectangle-Intersects-Rectangle

18.3. Spatial Index - Quadtree

18.3.1. Overview
18.3.2. Declaring a Point-Region Quadtree Index
18.3.3. Using a Point-Region Quadtree as a Filter Index
18.3.4. Using a Point-Region Quadtree as an Event Index
18.3.5. Declaring a MX-CIF Quadtree Index
18.3.6. Using a MX-CIF Quadtree as a Filter Index
18.3.7. Using a MX-CIF Quadtree as an Event Index

18.4. Spatial Types, Functions and Methods from External Libraries

18.1. Overview

EPL provides spatial methods and spatial indexes.

The compiler analyzes filter criteria and the where-clause and considers spatial methods, utilizing spatial filter indexes or spatial event indexes for efficient matching and lookup.

For general information on the dot-operator please consult Section 9.6, “Dot Operator”.

18.2. Spatial Methods

The below table summarizes the built-in spatial methods available:

Table 18.1. Spatial Methods

Method

Result

point(x,y).inside(rectangle(x,y,width,height))

Returns true if the point is inside the rectangle.

Section 18.2.1, “Point-Inside-Rectangle”.

rectangle(x,y,width,height).intersects(rectangle(x,y,width,height))

Returns true if the rectangle intersects with the rectangle.

Section 18.2.2, “Rectangle-Intersects-Rectangle”.

18.2.1. Point-Inside-Rectangle

The method compares a point to a rectangle and returns true if the point falls inside the rectangle.

The method takes a point as input and a rectangle as a parameter:

point(point_x, point_y [, filterindex:configexpression]).inside(rectangle(rect_x, rect_y, width, height))

For the point, please provide the point_x and point_y expressions that return the (x, y)-coordinates of the point. The filterindex named parameter is for use with filter indexes as described below. The left-hand side point can be subject to point-region quadtree indexing (MX-CIF quadtrees do not apply).

For the rectangle, the rect_x expression and rect_y expressions return the (x, y)-coordinates of the rectangle and the width expression and height expressions return the width and height of the rectangle.

All expressions must return a number-type and the implementation compares the double-values returned by the expressions.

A point is considered inside the rectangle if (point_x >= rect_x) and (point_x < rect_x + width) and (point_y >= rect_y) and (point_y < rect_y + height).

Table 18.2. Point-Inside-Rectangle Examples

Expression	Result
point(10, 20).inside(rectangle(0, 0, 50, 50))	true
point(10, 20).inside(rectangle(20, 20, 50, 50))	false
point(10, 20).inside(rectangle(9, 19, 1, 1))	false
point(10, 20).inside(rectangle(9, 19, 1.0001, 1.0001))	true

18.2.2. Rectangle-Intersects-Rectangle

The method compares a rectangle to a rectangle and returns true if the rectangles intersect.

The method takes a rectangle as input and a rectangle as a parameter:

rectangle(rect_x, rect_y, rect_width, rect_height [, filterindex:configexpression]).intersects(rectangle(other_x, other_y, other_width, other_height))

The left-hand side is the rectangle's rect_x, rects_y, rect_width and rect_height expressions that return the (x, y)-coordinates and the size of the rectangle. The filterindex named parameter is for use with filter indexes as described below. The left-hand side rectangle can be subject to MX-CIF quadtree indexing (point-region quadtrees do not apply).

For the compared-to rectangle on the right-hand side, the other_x, other_y, other_width and other_height expressions return the (x, y)-coordinates and size of the compared-to rectangle.

All expressions must return a number-type and the implementation compares the double-values returned by the expressions.

A rectangle is considered to intersect another rectangle if:

rect_x + rect_width >= other_x (a is not left of b) and
rect_x <= other_x + other_width (a is not right of b) and
rect_y + rect_height >= other_y (a is not above b) and
rect_y <= other_y + other_height (a is not below b).

Table 18.3. Rectangle-Intersects-Rectangle Examples

Expression	Result
rectangle(10, 20, 5, 5).intersects(rectangle(0, 0, 50, 50))	true
rectangle(10, 20, 5, 5).intersects(rectangle(20, 20, 50, 50))	false
rectangle(10, 20, 5, 5).intersects(rectangle(9, 19, 1, 1))	true
rectangle(10, 20, 5, 5).intersects(rectangle(9, 19, 0.999, 0.999))	false
rectangle(10, 20, 5, 5).intersects(rectangle(15, 25, 1, 1))	true
rectangle(10, 20, 5, 5).intersects(rectangle(15.001, 25.001, 1, 1))	false

18.3. Spatial Index - Quadtree

18.3.1. Overview

A quadtree is a tree data structure in which each branch node has exactly four children. Quadtrees are often used to partition a two-dimensional space by recursively subdividing it into four quadrants or regions (source:WikiPedia).

Quadtree indexes can be used for:

Filter indexes, which organize active filters so that they can be searched efficiently. When the runtime receives an event, it consults the filter indexes to determine which statements, if any, must process the event.
Event indexes, which organize properties of events so that they can be searched efficiently. When the runtime performs statement processing it may use event indexes to find correlated events efficiently.

The point-region quadtree is a quadtree for the efficient finding of points that fall inside a given rectangle. Use this index with the point-inside-rectangle method described above.

The MX-CIF quadtree is a quadtree for the efficient finding of rectangles that intersect with a given rectangle. Use this index with the rectangle-intersects-rectangle method described above.

While point-region quadtree and MX-CIF quadtree are similar, they are not compatible and are not the same. In point-region quadtree, only leaf nodes have data. In MX-CIF quadtrees both branch and leaf nodes have data as branches hold rectangles that don't fit any given quadrant. The runtime expands and shrinks both types of trees dynamically based on data by promoting or subdividing a leaf node to branch nodes when adding data and by demoting or merging branches to a leaf node when removing data.

18.3.2. Declaring a Point-Region Quadtree Index

Declaring a point-region quadtree index is the same for both filter indexes and event indexes. Point-region quadtrees are suitable for efficiently finding points inside a rectangle, when there are many points.

The synopsis to declare a point-region quadtree index, as part of a statement, is:

pointregionquadtree(min_x_expression, min_y_expression, 
  width, height [, leaf_capacity_expression [, max_tree_height_expression]])

The min_x_expression, min_y_expression, width, height are index parameter expressions that return the range of the index. The width and height must be greater zero. The index range rectangle is represented by double-type values internally. A point is inside the index range if x >= minX and y >= minY and x < minX+width and y < minY+height.

Note

An attempt to insert points into the index that are outside of the declared index range causes an exception.

The leaf_capacity_expression is optional and must return a positive integer. It defines the number of coordinates a node may contain before it gets split into regions. The default value is 4.

The max_tree_height_expression is optional and must return an integer value of 2 or more. It defines the maximum depth of the tree. Upon the tree reaching the maximum depth a leaf node does not get split into regions. The default value is 20.

18.3.3. Using a Point-Region Quadtree as a Filter Index

The section that summarizes filter indexes is Section 2.18.2, “Filter Indexes”. As there could be many point(...).inside(rectangle) filters active, having a filter index allows the runtime to efficiently match incoming events to statements.

For use of a point-region quadtree index within filter criteria you must:

Define an expression that returns the point-region quadtree configuration, making sure it specifies pointregionquadtree.
Add the filterindex named parameter providing the expression name.

For defining a local or global expression, please consult Section 5.2.9, “Expression Declaration”.

This sample statement defines the point-region quadtree filter index to have a bounding box of (0,0,100,100):

expression myPointRegionQuadtreeSettings { pointregionquadtree(0, 0, 100, 100) } 
select * from RectangleEvent(point(0, 0, filterindex:myPointRegionQuadtreeSettings).inside(rectangle(x, y, width, height)))

The filterindex named parameter instructs the runtime that the settings for the point-region quadtree filter index are provided by the expression myPointRegionQuadtreeSettings, a local expression in this example. For sharing point-region quadtree settings across statements you may use a global expression instead. Please see Section 5.18, “Declaring Global Expressions, Aliases and Scripts: Create Expression”.

If your EPL does not specify filterindex the runtime does not build a point-region quadtree filter index.

If your EPL specifies filterindex the runtime always builds and uses a point-region quadtree filter index. In the case the compiler analyses filter criteria and determines that it cannot use the point-region quadtree filter index, the compiler fails statement validation.

If your EPL specifies filterindex and the compiler determines that it cannot use the point-region quadtree filter index it fails statement validation.

The runtime shares point-region quadtree filter indexes across the runtime within the same event type given that:

Filters have the same rectangle expressions.
Filters use the same filterindex parameter i.e. the text myPointRegionQuadtreeSettings in above example.
Filters use the same point-region quadtree index configuration i.e. pointregionquadtree(0,0,100,100) in above example.

For use with the filterindex named parameter, the following requirements apply towards point expressions:

Point expressions must be a constant, a context-provided built-in property or an event property provided by a previous pattern match within the same pattern.

For use with the filterindex named parameter, the following requirements apply towards rectangle expressions:

Rectangle expressions must be event properties.

18.3.4. Using a Point-Region Quadtree as an Event Index

The section that summarizes event indexes is Section 2.18.3, “Event Indexes”. The create index clause is described in Section 6.9, “Explicitly Indexing Named Windows and Tables”.

Declare a point-region quadtree event index as follows:

create index ... on ... (
  (x_expression, y_expression) pointregionquadtree(pointregion_quadtree_configuration)
)

The x_expression and y_expression expressions form the index columns. The expressions return the (x, y)-coordinates and must return numeric values. Coordinates are represented as double-type values internally. See above for the pointregion_quadtree_configuration point-region quadtree configuration.

For example, assume a table that contains points:

create table PointTable(pointId string primary key, px double, py double)

This example EPL declares an index on the points, with px and py becoming index columns that determine (x, y)-coordinates:

create index PointIndex on PointTable((px, py) pointregionquadtree(0, 0, 100, 100))

The above sample quadtree index expects (x, y)-coordinates that are in the range 0 <= px <= 100 and 0 <= py <= 100.

The example schema for events providing rectangles is:

create schema RectangleEvent(rx double, ry double, w double, h double)

This EPL outputs, upon arrival of a RectangleEvent, all points that fall inside the rectangle:

on RectangleEvent
select pointId from PointTable
where point(px, py).inside(rectangle(rx, ry, w, h))

Internally the runtime does not instantiate point or rectangle objects at all but instead optimizes the expression to comparison between double-type values.

18.3.4.1. Point-Region Quadtree Event Index Usage Notes

Point-Region quadtree indexes allow computed values for both index columns and index parameters. For example, the following EPL declares an index wherein (x, y)-coordinates are (px/100, py/100)-values. The sample EPL assumes that context.frame is a built-in property as provided by context FramedCtx:

context FramedCtx create index PointIndex on PointTable((Math.round(px/100), Math.round(py/100)) pointregionquadtree(context.frame.startx, context.frame.starty, context.frame.w, context.frame.h))

The compiler compares the index column expressions to the point-inside-rectangle left-hand-side expressions to determine which index to use. For example, if the expression is point(px+1, py+1).inside(rectangle(rx, ry, w, h)) as (px+1, py+1) does not match (Math.round(px/100), Math.round(py/100)) the query planner does not use the index. If the expression is point(Math.round(px/100), Math.round(py/100)).inside(rectangle(rx, ry, w, h)) the query planner does use the index as index column expressions match.

The query planner prefers point-region quadtree over other index types. Index hints are not yet available for query planning with quadtree indexes.

18.3.5. Declaring a MX-CIF Quadtree Index

Declaring a MX-CIF quadtree index is the same for both filter indexes and event indexes. MX-CIF quadtrees are suitable for efficiently finding rectangles that intersect with a rectangle, when there are many rectangles.

The synopsis to declare a MX-CIF quadtree index, as part of a statement, is:

mxcifquadtree(min_x_expression, min_y_expression, 
  width, height [, leaf_capacity_expression [, max_tree_height_expression]])

The min_x_expression, min_y_expression, width, height are index parameter expressions that return the range of the index. The width and height must be greater zero. The index range rectangle is represented by double-type values internally. A given rectangle must intersect with the index range.

Note

An attempt to insert rectangles into the index that do not intersect with the declared index range causes an exception.

The leaf_capacity_expression is optional and must return a positive integer. It defines the number of coordinates a node may contain before it gets split into regions. The default value is 4.

18.3.6. Using a MX-CIF Quadtree as a Filter Index

The section that summarizes filter indexes is Section 2.18.2, “Filter Indexes”. As there could be many rectangle(...).intersects(rectangle) filters active, having a filter index allows the runtime to efficiently match incoming events to statements.

For use of a MX-CIF quadtree index within filter criteria you must:

Define an expression that returns the MX-CIF quadtree configuration, making sure it specifies mxcifquadtree.
Add the filterindex named parameter providing the expression name.

For defining a local or global expression, please consult Section 5.2.9, “Expression Declaration”.

This sample statement defines the MX-CIF quadtree filter index to have a bounding box of (0,0,100,100):

expression myMXCIFQuadtreeSettings { mxcifquadtree(0, 0, 100, 100) } 
select * from RectangleEvent(rectangle(10, 20, 5, 5, filterindex:myMXCIFQuadtreeSettings).intersects(rectangle(x, y, width, height)))

The filterindex named parameter instructs the compiler that the settings for the MX-CIF quadtree filter index are provided by the expression myMXCIFQuadtreeSettings, a local expression in this example. For sharing MX-CIF quadtree settings across statements you may use a global expression instead. Please see Section 5.18, “Declaring Global Expressions, Aliases and Scripts: Create Expression”.

If your EPL does not specify filterindex the runtime does not build a MX-CIF quadtree filter index.

If your EPL specifies filterindex the runtime always builds and uses a MX-CIF quadtree filter index. In the case the compiler analyses filter criteria and determines that it cannot use the MX-CIF quadtree filter index, the compiler fails statement validation.

If your EPL specifies filterindex and the compiler determines that it cannot use the MX-CIF quadtree filter index it fails statement validation.

The runtime shares MX-CIF quadtree filter indexes across the runtime within the same event type given that:

Filters have the same rectangle expressions.
Filters use the same filterindex parameter i.e. the text myMXCIFQuadtreeSettings in above example.
Filters use the same MX-CIF quadtree index configuration i.e. mxcifquadtree(0,0,100,100) in above example.

For use with the filterindex named parameter, the following requirements apply towards left-hand side rectangle expressions:

Left-hand side rectangle expressions must be a constant, a context-provided built-in property or an event property provided by a previous pattern match within the same pattern.

For use with the filterindex named parameter, the following requirements apply towards right-hand side rectangle expressions:

Right-hand side rectangle expressions must be event properties.

18.3.7. Using a MX-CIF Quadtree as an Event Index

The section that summarizes event indexes is Section 2.18.3, “Event Indexes”. The create index clause is described in Section 6.9, “Explicitly Indexing Named Windows and Tables”.

Declare a MX-CIF quadtree event index as follows:

create index ... on ... (
  (x_expression, y_expression, width_expression, height_expression) mxcifquadtree(mxcif_quadtree_configuration)
)

The x_expression, y_expression, width_expression and height_expression expressions form the index columns. The expressions return the (x, y)-coordinates and rectangle size and must return numeric values. Coordinates and sizes are represented as double-type values internally. See above for the mxcif_quadtree_configuration MX-CIF quadtree configuration.

For example, assume a table that contains rectangles:

create table RectangleTable(rectangleId string primary key, rx double, ry double, rwidth double, rheight double)

This example EPL declares an index on the rectangles, with rx, ry, rwidth and rheight becoming index columns that determine the (x, y)-coordinates and the sizes:

create index RectangleIndex on RectangleTable((rx, ry, rwidth, rheight) mxcifquadtree(0, 0, 100, 100))

The above sample quadtree index expects rectangles to intersect the rectangle (0, 0, 100, 100).

The example schema for arriving events is:

create schema OtherRectangleEvent(otherX double, otherY double, otherWidth double, otherHeight double)

This EPL outputs, upon arrival of a OtherRectangleEvent, all rectangles stored in the table that intersect the arriving-events rectangle:

on OtherRectangleEvent
select rectangleId from RectangleTable
where rectangle(rx, ry, rwidth, rheight).intersects(rectangle(otherX, otherY, otherWidth, otherHeight))

Internally the runtime does not instantiate rectangle objects at all but instead optimizes the expression to comparison between double-type values.

18.3.7.1. MX-CIF Quadtree Event Index Usage Notes

MX-CIF quadtree indexes allow computed values for both index columns and index parameters. For example, the following EPL declares an index wherein (x, y)-coordinates are (px/100, py/100)-values. The sample EPL assumes that context.frame is a built-in property as provided by context FramedCtx:

context FramedCtx create index RectangleIndex on RectangleTable((Math.round(rx/100), Math.round(ry/100), Math.round(rwidth/100), Math.round(rheight/100)) mxcifquadtree(context.frame.startx, context.frame.starty, context.frame.w, context.frame.h))

The compiler compares the index column expressions to the rectangle-interwsects-rectangle left-hand-side expressions to determine which index to use.

The query planner prefers MX-CIF quadtree over other index types. Index hints are not yet available for query planning with quadtree indexes.

18.4. Spatial Types, Functions and Methods from External Libraries

The scope of the compiler and runtime does not include addressing all geographical, topological or spatial processing. We encourage using external libraries for library calls. EPL makes it easy to use and extend EPL, using functions, methods, data types and data structures provided by external libraries.

For example, assume you would like to use a geometric data type and the geographical distance function. Please consider using the Java Topology Suite (JTS) (https://www.locationtech.org) which provides a pretty complete set of geo computing functionality.

To pick an example data type, the compiler and runtime allow any class such as the JTS Geometry class (org.locationtech.jts.geom.Geometry) to become an event type, an event property type or a column type in a named window, table. The compiler and runtime also allow the use of such class anywhere within EPL expressions.

The EPL snippet below declares an event type that has a Geometry property:

create schema ShapeArrivalEvent(shapeId string, geometry org.locationtech.jts.geom.Geometry) // use imports to remove the need to have a package name

EPL can call methods and your application can declare its own functions. Registering an own EPL function is described in Section 20.2, “Single-Row Function”.

This sample EPL outputs events that have a distance of more than 100 comparing the current event's geometry to the last 1 minute of previous event's geometry:

select * from ShapeArrivalEvent as e1 unidirectional, ShapeArrivalEvent.time(1 minute) as e2
where e1.geometry.distance(e2.geometry) > 100

Chapter 19. EPL Reference: Data Flow

19.1. Introduction

Data flows in EPL have the following purposes:

Support for data flow programming and flow-based programming.
Declarative and runtime manageable integration of input and output adapters that may be provided by EsperIO or by an application.
Remove the need to use an event bus achieving dataflow-only visibility of events and event types for performance gains.

Data flow operators communicate via streams of either underlying event objects or wrapped events. Underlying event objects are POJO, Map, Object-array or DOM/XML. Wrapped events are represented by EventBean instances that associate type information to underlying event objects.

For more information on data flow programming or flow-based programming please consult the Wikipedia FBP Article.

EPL offers a number of useful built-in operators that can be combined in a graph to program a data flow. In addition EsperIO offers prebuilt operators that act as sources or sinks of events. An application can easily create and use its own data flow operators.

Using data flows an application can provide events to the data flow operators directly without using an runtime's event bus. Not using an event bus (as represented by the sendEventType methods of EPEventService) can achieve performance gains as the runtime does not need to match events to statements and the runtime does not need to wrap underlying event objects in EventBean instances.

Data flows also allow for finer-grained control over threading, synchronous and asynchronous operation.

19.2. Usage

19.2.1. Overview

Your application declares a data flow using create dataflow dataflow-name. Declaring the data flow causes the EPL compiler to validate the syntax and some aspects of the data flow graph of operators. Declaring the data flow does not actually instantiate or execute a data flow. Resolving event types and instantiating operators (as required) takes place at time of data flow instantiation.

After your application has declared a data flow, it can instantiate the data flow and execute it. A data flow can be instantiated as many times as needed and each data flow instance can only be executed once.

The example EPL below creates a data flow that, upon execution, outputs the text Hello World to console and then ends.

create dataflow HelloWorldDataFlow
  BeaconSource -> helloworld.stream { text: 'hello world' , iterations: 1}
  LogSink(helloworld.stream) {}

The sample data flow above declares a BeaconSource operator parameterized by the "hello world" text and 1 iteration. The -> keyword reads as produces streams. The BeaconSource operator produces a single stream named helloworld.stream. The LogSink operator receives this stream and prints it unformatted.

The next program code snippet declares the data flow to the runtime:

String epl = "create dataflow HelloWorldDataFlow\n" +
  "BeaconSource -> helloworldStream { text: 'hello world' , iterations: 1}\n" +
  "LogSink(helloworldStream) {}";

Configuration configuration = new Configuration();
CompilerArguments compilerArguments = new CompilerArguments(configuration);
EPCompiled compiled = EPCompilerProvider.getCompiler().compile(epl, compilerArguments);
EPDeployment deployment = runtime.getDeploymentService().deploy(compiled);

After declaring a data flow to a runtime, your application can then instantiate and execute the data flow.

The following program code snippet instantiates the data flow:

EPDataFlowInstance instance =
  runtime.getDataFlowService().instantiate(deployment.getDeploymentId(), "HelloWorldDataFlow");

A data flow instance is represented by an EPDataFlowInstance object.

The next code snippet executes the data flow instance:

instance.run();

By using the run method of EPDataFlowInstance the runtime executes the data flow using the same thread (blocking execute) and returns when the data flow completes. A data flow completes when all operators receive final markers.

The hello world data flow simply prints an unformatted Hello World string to console. Please check the built-in operator reference for BeaconSource and LogSink for more options.

19.2.2. Syntax

The synopsis for declaring a data flow is:

create dataflow name
	[schema_declarations]
	[operator_declarations]

After create dataflow follows the data flow name and a mixed list of event type (schema) declarations and operator declarations.

Schema declarations define an event type. Specify any number of create schema clauses as part of the data flow declaration followed by a comma character to end each schema declaration. The syntax for create schema is described in Section 5.15, “Declaring an Event Type: Create Schema”.

All event types that are defined as part of a data flow are private to the data flow and not available to other statements. To define event types that are available across data flows and other statements, use a create schema statement, runtime or static configuration.

Annotations as well as expression declarations and scripts can also be pre-pended to the data flow declaration.

19.2.2.1. Operator Declaration

For each operator, declare the operator name, input streams, output streams and operator parameters.

The syntax for declaring a data flow operator is:

operator_name [(input_streams)]  [-> output_streams] {
  [parameter_name : parameter_value_expr] [, ...]
}

The operator name is an identifier that identifies an operator.

If the operator accepts input streams then those may be listed in parenthesis after the operator name, see Section 19.2.2.2, “Declaring Input Streams”.

If the operator can produce output streams then specify -> followed by a list of output stream names and types. See Section 19.2.2.3, “Declaring Output Streams”.

Following the input and output stream declaration provide curly brackets ({}) containing operator parameters. See Section 19.2.2.4, “Declaring Operator Parameters”.

An operator that receives no input streams, produces no output streams and has no parameters assigned to it is shown in this EPL example data flow:

create dataflow MyDataFlow
  MyOperatorSimple {}

The next EPL shows a data flow that consists of an operator MyOperator that receives a single input stream myInStream and produces a single output stream myOutStream holding MyEvent events. The EPL configures the operator parameter myParameter with a value of 10:

create dataflow MyDataFlow
  create schema MyEvent as (id string, price double),
  MyOperator(myInStream) -> myOutStream<MyEvent> {
    myParameter : 10
  }

The next sections outline input stream, output stream and parameter assignment in greater detail.

19.2.2.2. Declaring Input Streams

In case the operator receives input streams, list the input stream names within parenthesis following the operator name. As part of the input stream declaration you may use the as keyword to assign an alias short name to one or multiple input streams.

The EPL shown next declares myInStream and assigns the alias mis:

create dataflow MyDataFlow
  MyOperator(myInStream as mis) {}

Multiple input streams can be listed separated by comma. We use the term input port to mean the ordinal number of the input stream in the order the input streams are listed.

The EPL as below declares two input streams and assigns an alias to each. The runtime assigns streamOne to input port 0 (zero) and streamTwo to port 1.

create dataflow MyDataFlow
  MyOperator(streamOne as one, streamTwo as two) {}

You may assign multiple input streams to the same port and alias by placing the stream names into parenthesis. All input streams for the same port must have the same event type associated.

The next statement declares an operator that receives input streams streamA and streamB both assigned to port 0 (zero) and alias streamsAB:

create dataflow MyDataFlow
  MyOperator( (streamA, streamB) as streamsAB) {}

Input and output stream names can have the dot-character in their name.

The following is also valid EPL:

create dataflow MyDataFlow
  MyOperator(my.in.stream) -> my.out.stream {}

Note

Reserved keywords may not appear in the stream name.

19.2.2.3. Declaring Output Streams

In case the operator produces output streams, list the output streams after the -> keyword. Multiple output streams can be listed separated by comma. We use the term output port to mean the ordinal number of the output stream in the order the output streams are listed.

The sample EPL below declares an operator that produces two output streams my.out.one and my.out.two.

create dataflow MyDataFlow
  MyOperator -> my.out.one, my.out.two {}

Each output stream can be assigned optional type information within less/greater-then (<>). Type information is required if the operator cannot deduce the output type from the input type and the operator does not declare explicit output type(s). The event type name can either be an event type defined within the same data flow or an event type defined in the runtime.

This EPL example declares an RFIDSchema event type based on an object-array event representation and associates the output stream rfid.stream with the RFIDSchema type. The stream rfid.stream therefore carries object-array (Object[]) typed objects according to schema RFIDSchema:

create dataflow MyDataFlow
  create objectarray schema RFIDSchema (tagId string, locX double, locY double),
  MyOperator -> rfid.stream<RFIDSchema> {}

The keyword eventbean is reserved: Use eventbean<type-name> to indicate that a stream carries EventBean instances of the given type instead of the underlying event object.

This EPL example declares an RFIDSchema event type based on an object-array event representation and associates the output stream rfid.stream with the event type, such that the stream rfid.stream carries EventBean objects:

create dataflow MyDataFlow
  create objectarray schema RFIDSchema (tagId string, locX double, locy double),
  MyOperator -> rfid.stream<eventbean<RFIDSchema>> {}

Use questionmark (?) to indicate that the type of events is not known in advance.

In the next EPL the stream my.stream carries EventBean instances of any type:

create dataflow MyDataFlow
  MyOperator -> my.stream<eventbean<?>> {}

19.2.2.4. Declaring Operator Parameters

Operators can receive constants, objects, EPL expressions and complete statements as parameters. All parameters are listed within curly brackets ({}) after input and output stream declarations. Curly brackets are required as a separator even if the operator has no parameters.

The syntax for parameters is:

name : value_expr [,...]

The parameter name is an identifier that is followed by the colon (:) or equals (=) character and a value expression. A value expression can be any expression, system property, JSON notation object or statement. Parameters are separated by comma character.

The next EPL demonstrates operator parameters that are scalar values:

create dataflow MyDataFlow
  MyOperator {
    stringParam : 'sample',
    secondString : "double-quotes are fine",
    intParam : 10
  }

Operator parameters can be any EPL expression including expressions that use variables. Subqueries, aggregations and the prev and prior functions cannot be applied here.

The EPL shown below lists operator parameters that are expressions:

create dataflow MyDataFlow
  MyOperator {
    intParam : 24*60*60,
    threshold : var_threshold	// a variable defined in the runtime
  }

To obtain the value of a system property, the special systemProperties property name is reserved for access to system properties.

The following EPL sets operator parameters to a value obtained from a system property:

create dataflow MyDataFlow
  MyOperator {
    someSystemProperty : systemProperties('mySystemProperty') 
  }

Any JSON value can also be used as a value. Use square brackets [] for JSON arrays. Use curly brackets {} to hold nested Map or other object values. Provide the special class property to instantiate a given instance by class name. The runtime populates the respective array, Map or Object as specified in the JSON parameter value.

The below EPL demonstrates operator parameters that are JSON values:

create dataflow MyDataFlow
  MyOperator {
    myStringArray: ['a', "b"],
    myMapOrObject: {
      a : 10,
      b : 'xyz',
    },
    myInstance: {
      class: 'com.myorg.myapp.MyImplementation',
      myValue : 'sample'
    }
  }

The special parameter name select is reserved for use with EPL select statements. Please see the Select built-in operator for an example.

19.3. Built-In Operators

The below table summarizes the built-in data flow operators available:

Table 19.1. Built-in Operators

Operator	Description
BeaconSource	Utility source that generates events. See Section 19.3.1, “BeaconSource”.
Emitter	Special operator for injecting events into a stream. See Section 19.4.5, “Start Captive”.
EPStatementSource	One or more statements act as event sources. See Section 19.3.2, “EPStatementSource”.
EventBusSink	The event bus is the sink: Sends events from the data flow into the event bus. See Section 19.3.3, “EventBusSink”.
EventBusSource	The event bus is the source: Receives events from the event bus into the data flow. See Section 19.3.4, “EventBusSource”.
Filter	Filters an input stream and produces an output stream containing the events passing the filter criteria. See Section 19.3.5, “Filter”.
LogSink	Utility sink that outputs events to console or log. See Section 19.3.6, “LogSink”.
Select	An EPL select statement that executes on the input stream events. See Section 19.3.7, “Select”.

The below table summarizes the built-in EsperIO data flow operators. Please see the EsperIO documentation and source for more information.

Table 19.2. EsperIO Built-in Operators

Operator	Description
AMQPSource	Attaches to AMQP broker to receive messages to process.
AMQPSink	Attaches to AMQP broker to send messages.
FileSource	Reads one or more files and produces events from file data.
FileSink	Write one or more files from events received.

19.3.1. BeaconSource

The BeaconSource operator generates events and populates event properties.

The BeaconSource operator does not accept any input streams and has no input ports.

The BeaconSource operator must have a single output stream. When the BeaconSource operator completed generating events according to the number of iterations provided or when it is cancelled it outputs a final marker to the output stream.

Parameters for the BeaconSource operator are all optional parameters:

Table 19.3. BeaconSource Parameters

Name	Description
initialDelay	Specifies the number of seconds delay before producing events.
interval	Time interval between events. Takes a integer or double-typed value for the number of seconds. The interval is zero when not provided.
iterations	Number of events produced. Takes an integer value. When not provided the operator produces tuples until the data flow instance gets cancelled.

Event properties to be populated can simply be added to the parameters.

If your declaration provides an event type for the output stream then BeaconSource will populate event properties of the underlying events. If no event type is specified, BeaconSource creates an anonymous object-array event type to carry the event properties that are generated and associates this type with its output stream.

Examples are:

create dataflow MyDataFlow
  create schema SampleSchema(tagId string, locX double),	// sample type			
			
  // BeaconSource that produces empty object-array events without delay 
  // or interval until cancelled.
  BeaconSource -> stream.one {}
  
  // BeaconSource that produces one RFIDSchema event populating event properties
  // from a user-defined function "generateTagId" and the provided values.
  BeaconSource -> stream.two<SampleSchema> {
    iterations : 1,
    tagId : generateTagId(),
    locX : 10
  }
  
  // BeaconSource that produces 10 object-array events populating
  // the price property with a random value.
  BeaconSource -> stream.three {
    iterations : 10,
    interval : 10, // every 10 seconds
    initialDelay : 5, // start after 5 seconds
    price : Math.random() * 100
  }

19.3.2. EPStatementSource

The EPStatementSource operator maintains a subscription to the results of one or more statements. The operator produces the statement output events.

The EPStatementSource operator does not accept any input streams and has no input ports.

The EPStatementSource operator must have a single output stream. It does not generate a final or other marker.

Either the statement name or the statement filter parameter is required:

Table 19.4. EPStatementSource Parameters

Name	Description
collector	Optional parameter, used to transform statement output events to submitted events.
statementName	Name of the statement that produces events. The statement does not need to exist at the time of data flow instantiation.
statementFilter	Implementation of the `EPDataFlowEPStatementFilter` that returns true for each statement that produces events. Statements do not need to exist at the time of data flow instantiation.

If a statement name is provided, the operator subscribes to output events of the statement if the statement exists or when it gets created at a later point in time.

If a statement filter is provided instead, the operator subscribes to output events of all statements that currently exist and pass the filter pass method or that get created at a later point in time and pass the filter pass method.

The collector can be specified to transform output events. If no collector is specified the operator submits the underlying events of the insert stream received from the statement. The collector object must implement the interface EPDataFlowIRStreamCollector.

Examples are:

create dataflow MyDataFlow
  create schema SampleSchema(tagId string, locX double),	// sample type			
			
  // Consider only the statement named MySelectStatement when it exists.
  // No transformation.
  EPStatementSource -> stream.one<eventbean<?>> {
    statementName : 'MySelectStatement'
  }
  
  // Consider all statements that match the filter object provided.
  // No transformation.
  EPStatementSource -> stream.two<eventbean<?>> {
    statementFilter : {
      class : 'com.mycompany.filters.MyStatementFilter'
    }
  }
  
  // Consider all statements that match the filter object provided.
  // With collector that performs transformation.
  EPStatementSource -> stream.two<SampleSchema> {
    collector : {
      class : 'com.mycompany.filters.MyCollector'
    },
    statementFilter : {
      class : 'com.mycompany.filters.MyStatementFilter'
    }
  }

19.3.3. EventBusSink

The EventBusSink operator send events received from a data flow into the event bus. Any statement that looks for any of the events gets triggered, equivalent to the sendEventType methods on EPEventService or the insert into clause.

The EventBusSink operator accepts any number of input streams. The operator forwards all events arriving on any input ports to the event bus, equivalent to the sendEventType methods on EPEventService.

The EventBusSink operator cannot declare any output streams.

Parameters for the EventBusSink operator are all optional parameters:

Table 19.5. EventBusSink Parameters

Name	Description
collector	Optional parameter, used to transform data flow events to event bus events.

The collector can be specified to transform data flow events to event bus events. If no collector is specified the operator submits the events directly to the event bus. The collector object must implement the interface EPDataFlowEventCollector.

Examples are:

create dataflow MyDataFlow
  BeaconSource -> instream<SampleSchema> {}  // produces a sample stream
  
  // Send SampleSchema events produced by beacon to the event bus.
  EventBusSink(instream) {}
  
  // Send SampleSchema events produced by beacon to the event bus.
  // With collector that performs transformation.
  EventBusSink(instream) {
    collector : {
      class : 'com.mycompany.filters.MyCollector'
    }
  }

19.3.4. EventBusSource

The EventBusSource operator receives events from the event bus and produces an output stream of the events received. With the term event bus we mean any event visible to the runtime either because the application send the event via any of the sendEventType methods on EPEventService or because statements populated streams as a result of insert into.

The EventBusSource operator does not accept any input streams and has no input ports.

The EventBusSource operator must have a single output stream. It does not generate a final or other marker. The event type declared for the output stream is the event type of events received from the event bus.

All parameters to EventBusSource are optional:

Table 19.6. EventBusSource Parameters

Name	Description
collector	Optional parameter and used to transform event bus events to submitted events.
filter	Filter expression for event bus matching.

The collector can be specified to transform output events. If no collector is specified the operator submits the underlying events of the stream received from the event bus. The collector object must implement the interface EPDataFlowEventBeanCollector.

The filter is an expression that the event bus compiles and efficiently matches even in the presence of a large number of event bus sources. The filter expression must return a boolean-typed value, returning true for those events that the event bus passes to the operator.

Examples are:

create dataflow MyDataFlow

  // Receive all SampleSchema events from the event bus.
  // No transformation.
  EventBusSource -> stream.one<SampleSchema> {}
  
  // Receive all SampleSchema events with tag id '001' from the event bus.
  // No transformation.
  EventBusSource -> stream.one<SampleSchema> {
    filter : tagId = '001'
  }

  // Receive all SampleSchema events from the event bus.
  // With collector that performs transformation.
  EventBusSource -> stream.two<SampleSchema> {
    collector : {
      class : 'com.mycompany.filters.MyCollector'
    },
  }

19.3.5. Filter

The Filter operator filters an input stream and produces an output stream containing the events passing the filter criteria. If a second output stream is provided, the operator sends events not passing filter criteria to that output stream.

The Filter operator accepts a single input stream.

The Filter operator requires one or two output streams. The event type of the input and output stream(s) must be the same. The first output stream receives the matching events according to the filter expression. If declaring two output streams, the second stream receives non-matching events.

The Filter operator has a single required parameter:

Table 19.7. Filter Parameters

Name	Description
filter	The filter criteria expression.

Examples are:

create dataflow MyDataFlow
  create schema SampleSchema(tagId string, locX double),	// sample type
  BeaconSource -> samplestream<SampleSchema> {}  // sample source
  
  // Filter all events that have a tag id of '001'
  Filter(samplestream) -> tags_001 {
    filter : tagId = '001' 
  }
  
  // Filter all events that have a tag id of '001', 
  // putting all other events into the second stream
  Filter(samplestream) -> tags_001, tags_other {
    filter : tagId = '001' 
  }

19.3.6. LogSink

The LogSink operator outputs events to console or log file in either a JSON, XML or built-in format (the default).

The LogSink operator accepts any number of input streams. All events arriving on any input ports are logged.

The LogSink operator cannot declare any output streams.

Parameters for the LogSink operator are all optional parameters:

Table 19.8. LogSink Parameters

Name	Description
format	Specify format as a string value: `json` for JSON-formatted output, `xml` for XML-formatted output and `summary` (default) for a built-in format.
layout	Pattern string according to which output is formatted. Place `%df` for data flow name, `%p` for port number, `%i` for data flow instance id, `%t` for title, `%e` for event data.
log	Boolean true (default) for log output, false for console output.
linefeed	Boolean true (default) for line feed, false for no line feed.
title	String title text pre-pended to output.

Examples are:

create dataflow MyDataFlow
  BeaconSource -> instream {}  // produces sample stream to use below
  
  // Output textual event to log using defaults.
  LogSink(instream) {}
  
  // Output JSON-formatted to console.
  LogSink(instream) {
    format : 'json',
    layout : '%t [%e]',
    log : false,
    linefeed : true,
    title : 'My Custom Title:'
  }

19.3.7. Select

The Select operator is configured with an EPL select statement. It applies events from input streams to the select statement and outputs results either continuously or when the final marker arrives.

The Select operator accepts one or more input streams.

The Select operator requires a single output stream.

The Select operator requires the select parameter, all other parameters are optional:

Table 19.9. Select Operator Parameters

Name	Description
iterate	Boolean indicator whether results should be output continuously or only upon arrival of the final marker.
select	EPL `select` statement in parenthesis.

Set the optional iterate flag to false (the default) to have the operator output results continuously. Set the iterate flag to true to indicate that the operator outputs results only when the final marker arrives. If iterate is true then output rate limiting clauses are not supported.

The select parameter is required and provides an EPL select statement within parenthesis. For each input port the statement should list the input stream name or the alias name in the from clause. Only filter-based streams are allowed in the from clause and patterns or named windows are not supported. Also not allowed are the insert into clause, the irstream keyword and subselects.

The Select operator determines the event type of output events based on the select clause. It is not necessary to declare an event type for the output stream.

Examples are:

create dataflow MyDataFlow
  create schema SampleSchema(tagId string, locX double),	// sample type			
  BeaconSource -> instream<SampleSchema> {}  // sample stream
  BeaconSource -> secondstream<SampleSchema> {}  // sample stream
  
  // Simple continuous count of events
  Select(instream) -> outstream {
    select: (select count(*) from instream)
  }
  
  // Demonstrate use of alias
  Select(instream as myalias) -> outstream {
    select: (select count(*) from myalias)
  }
  
  // Output only when the final marker arrives
  Select(instream as myalias) -> outstream {
    select: (select count(*) from myalias),
    iterate: true
  }

  // Same input port for the two sample streams
  Select( (instream, secondstream) as myalias) -> outstream {
    select: (select count(*) from myalias)
  }

  // A join with multiple input streams,
  // joining the last event per stream forming pairs
  Select(instream, secondstream) -> outstream {
    select: (select a.tagId, b.tagId 
        from instream#lastevent as a, secondstream#lastevent as b)
  }
  
  // A join with multiple input streams and using aliases.
  Select(instream as S1, secondstream as S2) -> outstream {
    select: (select a.tagId, b.tagId 
        from S1#lastevent as a, S2#lastevent as b)
  }

19.4. API

This section outlines the steps to declare, instantiate, execute and cancel or complete data flows.

19.4.1. Declaring a Data Flow

Compile data flow the same as any other statement and deploy the compiled module. The EPStatementObjectModel statement object model can also be used to compile a data flow.

Annotations that are listed at the top of the EPL text are applied to all statements and operators in the data flow. Annotations listed for a specific operator apply to that operator only.

The next program code snippet declares a data flow to the runtime:

String epl = "@Name('MyStatementName') create dataflow HelloWorldDataFlow\n" +
  "BeaconSource -> helloworldStream { text: 'hello world' , iterations: 1}\n" +
  "LogSink(helloworldStream) {}";

Configuration configuration = new Configuration();
CompilerArguments compilerArguments = new CompilerArguments(configuration);
EPCompiled compiled = EPCompilerProvider.getCompiler().compile(epl, compilerArguments);
EPDeployment deployment = runtime.getDeploymentService().deploy(compiled);

The statement name that can be assigned to the statement is used only for statement management. Your application may undeploy the statement declaring the data flow thereby making the data flow unavailable for instantiation. Existing instances of the data flow are not affected by an undeploy of the statement that declares the data flow.

Listeners or the subscriber to the statement declaring a data flow receive no events or other output. The statement declaring a data flow returns no rows when iterated.

19.4.2. Instantiating a Data Flow

The com.espertech.esper.common.client.dataflow.core.EPDataFlowService available via getDataFlowService on EPRuntime manages declared data flows.

Use the instantiate method on EPDataFlowRuntime to instantiate a data flow after it has been declared. Pass the data flow name and optional instantiation options to the method. A data flow can be instantiated any number of times.

A data flow instance is represented by an instance of EPDataFlowInstance. Each instance has a state as well as methods to start, run, join and cancel as well as methods to obtain execution statistics.

Various optional arguments including operator parameters can be passed to instantiate via the EPDataFlowInstantiationOptions object as explained in more detail below.

The following code snippet instantiates the data flow:

EPDataFlowInstance instance =
  runtime.getDataFlowService().instantiate(deployment.getDeploymentId(), "HelloWorldDataFlow");

The runtime does not track or otherwise retain data flow instances in memory. It is up to your application to retain data flow instances as needed.

Each data flow instance associates to a state. The start state is EPDataFlowState.INSTANTIATED. The end state is either COMPLETED or CANCELLED.

The following table outlines all states:

Table 19.10. Data Flow Instance States

State	Description
INSTANTIATED	Start state, applies when a data flow instance has been instantiated and has not executed.
RUNNING	A data flow instance transitions from instantiated to running when any of the `start`, `run` or `startCaptive` methods are invoked.
COMPLETED	A data flow instance transitions from running to completed when all final markers have been processed by all operators.
CANCELLED	A data flow instance transitions from running to cancelled when your application invokes the `cancel` method on the data flow instance.

19.4.3. Executing a Data Flow

After your application instantiated a data flow instance it can execute the data flow instance using either the start, run or startCaptive methods.

Use the start method to have the runtime allocate a thread for each source operator. Execution is non-blocking. Use the join method to have one or more threads join a data flow instance execution.

Use the run method to have the runtime use the current thread to execute the single source operator. Multiple source operators are not allowed when using run.

Use the startCaptive method to have the runtime return all Runnable instances and emitters, for the purpose of having complete control over execution. The runtime allocates no threads and does not perform any logic for the data flow unless your application employs the Runnable instances and emitters returned by the method.

The next code snippet executes the data flow instance as a blocking call:

instance.run();

By using the run method of EPDataFlowInstance the runtime executes the data flow instance using the same thread (blocking execute) and returns when the data flow instance completes. A data flow instance completes when all operators receive final markers.

The hello world data flow simply prints an unformatted Hello World string to console. The BeaconSource operator generates a final marker when it finishes the 1 iteration. The data flow instance thus transitions to complete after the LogSink operator receives the final marker, and the thread invoking the run method returns.

The next code snippet executes the data flow instance as a non-blocking call:

instance.start();

Use the cancel method to cancel execution of a running data flow instance:

instance.cancel();

Use the join method to join execution of a running data flow instance, causing the joining thread to block until the data flow instance either completes or is cancelled:

instance.join();

19.4.4. Instantiation Options

The EPDataFlowInstantiationOptions object that can be passed to the instantiate method may be used to customize the operator graph, operator parameters and execution of the data flow instance.

Passing runtime parameters to data flow operators is easiest using the addParameterURI method. The first parameter is the data flow operator name and the operator parameter name separated by the slash character. The second parameter is the value object.

For example, in order to pass the file name to the FileSource operator at runtime, use the following code:

EPDataFlowInstantiationOptions options = new EPDataFlowInstantiationOptions();
options.addParameterURI("FileSource/file", filename);
EPDataFlowInstance instance = runtime.getDataFlowService().instantiate(deployment.getDeploymentId(), "MyFileReaderDataFlow",options);
instance.run();

The optional operatorProvider member takes an implementation of the EPDataFlowOperatorProvider interface. The runtime invokes this provider to obtain operator instances.

The optional parameterProvider member takes an implementation of the EPDataFlowOperatorParameterProvider interface. The runtime invokes this provider to obtain operator parameter values. The values override the values provided via parameter URI above.

The optional exceptionHandler member takes an implementation of the EPDataFlowExceptionHandler interface. The runtime invokes this provider to when exceptions occur.

The optional dataFlowInstanceId can be assigned any string value for the purpose of identifying the data flow instance.

The optional dataFlowInstanceUserObject can be assigned any object value for the purpose of associating a user object to the data flow instance.

Set the operatorStatistics flag to true to obtain statistics for operator execution.

Set the cpuStatistics flag to true to obtain CPU statistics for operator execution.

19.4.5. Start Captive

Use the startCaptive method on a EPDataFlowInstance data flow instance when your application requires full control over threading. This method returns an EPDataFlowInstanceCaptive instance that contains a list of java.lang.Runnable instances that represent each source operator.

The special Emitter operator can occur in a data flow. This emitter can be used to inject events into the data flow without writing a new operator. Emitter takes a single name parameter that provides the name of the emitter and that is returned in a map of emitters by EPDataFlowInstanceCaptive.

The example EPL below creates a data flow that uses emitter.

create dataflow HelloWorldDataFlow
  create objectarray schema SampleSchema(text string),	// sample type		
	
  Emitter -> helloworld.stream<SampleSchema> { name: 'myemitter' }
  LogSink(helloworld.stream) {}

Your application may obtain the Emitter instance and sends events directly into the output stream. This feature is only supported in relationship with startCaptive since the runtime does not allocate any threads or run source operators.

The example code snippet below obtains the emitter instance and send events directly into the data flow instance:

EPDataFlowInstance instance =
      runtime.getDataFlowService().instantiate(deployment.getDeploymentId(), "HelloWorldDataFlow", options);
EPDataFlowInstanceCaptive captiveStart = instance.startCaptive();
Emitter emitter = captiveStart.getEmitters().get("myemitter");
emitter.submit(new Object[] {"this is some text"});

When emitting DOM XML events please emit the root element obtained from document.getDocumentElement().

19.4.6. Data Flow Punctuation With Markers

When your application executes a data flow instance by means of the start (non-blocking) or run (blocking) methods, the data flow instance stays running until either completed or cancelled. While cancellation is always via the cancel method, completion occurs when all source operators provide final markers.

The final marker is an object that implements the EPDataFlowSignalFinalMarker interface. Some operators may also provide or process data window markers which implement the EPDataFlowSignalWindowMarker interface. All such signals implement the EPDataFlowSignal interface.

Some source operators such as EventBusSource and EPStatementSource do not generate final markers as they act continuously.

19.4.7. Exception Handling

All exceptions during the execution of a data flow are logged and reported to the EPDataFlowExceptionHandler instance if one was provided.

If no exception handler is provided or the provided exception handler re-throws or generates a new runtime exception, the source operator handles the exception and completes (ends). When all source operators complete then the data flow instance transitions to complete.

19.5. Examples

The following example is a rolling top words count implemented as a data flow, over a 30 second time window and providing the top 3 words every 2 seconds:

create dataflow RollingTopWords
  create objectarray schema WordEvent (word string),
  
  Emitter -> wordstream<WordEvent> {name:'a'} {} // Produces word stream
  
  Select(wordstream) -> wordcount { // Sliding time window count per word
    select: (select word, count(*) as wordcount 
          from wordstream#time(30) group by word)
  }

  Select(wordcount) -> wordranks { // Rank of words
    select: (select window(*) as rankedWords 
          from wordcount#sort(3, wordcount desc) 
          output snapshot every 2 seconds)
  }
  
  LogSink(wordranks) {}

The next example implements a bargain index computation that separates a mixed trade and quote event stream into a trade and a quote stream, computes a vwap and joins the two streams to compute an index:

create dataflow VWAPSample
  create objectarray schema TradeQuoteType as (type string, ticker string, price double, volume long, askprice double, asksize long),
  
  MyObjectArrayGraphSource -> TradeQuoteStream<TradeQuoteType> {}
  
  Filter(TradeQuoteStream) -> TradeStream {
    filter: type = "trade"
  }
  
  Filter(TradeQuoteStream) -> QuoteStream {
    filter: type = "quote"
  }
  
  Select(TradeStream) -> VwapTrades {
    select: (select ticker, sum(price * volume) / sum(volume) as vwap, 
          min(price) as minprice
          from TradeStream#groupwin(ticker)#length(4) group by ticker)
  }
  
  Select(VwapTrades as T, QuoteStream as Q) -> BargainIndex {
    select: 
      (select case when vwap > askprice then asksize * (Math.exp(vwap - askprice)) else 0.0d end as index
      from T#unique(ticker) as t, Q#lastevent as q
      where t.ticker = q.ticker)
  }
  
  LogSink(BargainIndex) {}

The final example is a word count data flow, in which three custom operators tokenize, word count and aggregate. The custom operators in this example are discussed next.

create dataflow WordCount
  MyLineFeedSource -> LineOfTextStream {}
  MyTokenizerCounter(LineOfTextStream) -> SingleLineCountStream {}
  MyWordCountAggregator(SingleLineCountStream) -> WordCountStream {}
  LogSink(WordCountStream) {}

19.6. Operator Implementation

Note

Implementing an operator requires the use of extension and internal APIs that are not considered stable and may change between versions.

This section discusses how to implement classes that serve as operators in a data flow. The section employs the example data flow as shown earlier.

This example data flow has operators MyLineFeedSource, MyTokenizerCounter and MyWordCountAggregator that are application provided operators:

create dataflow WordCount
  MyLineFeedSource -> LineOfTextStream {}
  MyTokenizerCounter(LineOfTextStream) -> SingleLineCountStream {}
  MyWordCountAggregator(SingleLineCountStream) -> WordCountStream {}
  LogSink(WordCountStream) {}

Each operator requires implementing the following interfaces:

Implement the DataFlowOperatorForge interface for the compiler to use.
Implement the DataFlowOperatorFactory interface for the runtime to instantiate operator instances.
Implement either the DataFlowOperator interface, the DataFlowOperatorLifecycle or the DataFlowSourceOperator interface.

The compiler must be able to find the class implementing DataFlowOperatorForge. Add the forge package or forge class to imports:

// Sample code adds 'package.*' to simply import the package.
Configuration configuration = new Configuration();
configuration.getCommon().addImport(MyLineFeedSourceForge.class.getName());

19.6.1. Sample Operator Acting as Source

Every operator has a forge class that implements the DataFlowOperatorForge interface and is only used at compile-time. The compiler provides the operator parameter expressions to the forge instance and invokes the initializeForge method. When it is time to compile the compiler generates code by invoking the make method.

// The OutputTypes annotation can be used to specify the type of events
// that are output by the operator.
// If provided, it is not necessary to declare output types in the data flow.
// The event representation is object-array.
@OutputTypes(value = {
        @OutputType(name = "line", typeName = "String")
})

// Provide the DataFlowOpProvideSignal annotation to indicate that
// the source operator provides a final marker.
@DataFlowOpProvideSignal
public class MyLineFeedSourceForge implements DataFlowOperatorForge {

    public DataFlowOpForgeInitializeResult initializeForge(DataFlowOpForgeInitializeContext context) throws ExprValidationException {
        return null;
    }

    public CodegenExpression make(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
        return newInstance(MyLineFeedSourceFactory.class);
    }
}

The operator factory class must implement the DataFlowOperatorFactory interface. At deployment time the operator factory initializes using the code generated in the forge make method. Upon instantiating a data flow the factory must return an operator instance.

The implementation for the sample MyLineFeedSourceFactory is:

public class MyLineFeedSourceFactory implements DataFlowOperatorFactory {

    public void initializeFactory(DataFlowOpFactoryInitializeContext context) {
    }

    public DataFlowOperator operator(DataFlowOpInitializeContext context) {
        return new MyLineFeedSource(Collections.emptyIterator());
    }
}

The operator implementation for the sample MyLineFeedSource is:

public class MyLineFeedSource implements DataFlowSourceOperator {

    @DataFlowContext
    private EPDataFlowEmitter dataFlowEmitter;

    private final Iterator<String> lines;

    public MyLineFeedSource(Iterator<String> lines) {
        this.lines = lines;
    }

    public void open(DataFlowOpOpenContext openContext) {
    }

    public void next() {
        if (lines.hasNext()) {
            dataFlowEmitter.submit(new Object[]{lines.next()});
        } else {
            dataFlowEmitter.submitSignal(new EPDataFlowSignalFinalMarker() {
            });
        }
    }

    public void close(DataFlowOpCloseContext openContext) {
    }
}

19.6.2. Sample Tokenizer Operator

The implementation for the sample MyTokenizerCounter is a forge, factory and operator in one class:

@OutputTypes({
        @OutputType(name = "line", type = int.class),
        @OutputType(name = "wordCount", type = int.class),
        @OutputType(name = "charCount", type = int.class)
})
public class MyTokenizerCounter implements DataFlowOperatorForge, DataFlowOperatorFactory, DataFlowOperator {
    private static final Logger log = LoggerFactory.getLogger(MyTokenizerCounter.class);

    @DataFlowContext
    private EPDataFlowEmitter graphContext;

    public DataFlowOpForgeInitializeResult initializeForge(DataFlowOpForgeInitializeContext context) throws ExprValidationException {
        return null;
    }

    public CodegenExpression make(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
        return newInstance(MyTokenizerCounter.class);
    }

    public void initializeFactory(DataFlowOpFactoryInitializeContext context) {
    }

    public DataFlowOperator operator(DataFlowOpInitializeContext context) {
        return new MyTokenizerCounter();
    }

    public void onInput(String line) {
        StringTokenizer tokenizer = new StringTokenizer(line, " \t");
        int wordCount = tokenizer.countTokens();
        int charCount = 0;
        while (tokenizer.hasMoreTokens()) {
            String token = tokenizer.nextToken();
            charCount += token.length();
        }
        log.debug("Submitting stat words[" + wordCount + "] chars[" + charCount + "] for line '" + line + "'");
        graphContext.submit(new Object[]{1, wordCount, charCount});
    }
}

19.6.3. Sample Aggregator Operator

The implementation for the sample MyWordCountAggregator with comments is:

@OutputTypes(value = {
        @OutputType(name = "stats", type = MyWordCountStats.class)
})
public class MyWordCountAggregator implements DataFlowOperatorForge, DataFlowOperatorFactory, DataFlowOperator {
    private static final Logger log = LoggerFactory.getLogger(MyWordCountAggregator.class);

    @DataFlowContext
    private EPDataFlowEmitter graphContext;

    private final MyWordCountStats aggregate = new MyWordCountStats();

    public DataFlowOpForgeInitializeResult initializeForge(DataFlowOpForgeInitializeContext context) throws ExprValidationException {
        return null;
    }

    public CodegenExpression make(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
        return newInstance(MyWordCountAggregator.class);
    }

    public void initializeFactory(DataFlowOpFactoryInitializeContext context) {
    }

    public DataFlowOperator operator(DataFlowOpInitializeContext context) {
        return new MyWordCountAggregator();
    }

    public void onInput(int lines, int words, int chars) {
        aggregate.add(lines, words, chars);
        log.debug("Aggregated: " + aggregate);
    }

    public void onSignal(EPDataFlowSignal signal) {
        log.debug("Received punctuation, submitting totals: " + aggregate);
        graphContext.submit(aggregate);
    }
}

19.6.4. Passing Operator Parameters

The forge instance receives parameters expressions. A forge can declare parameters like so:

// Expose a parameter named "file" that takes any expression as parameter
@DataFlowOpParameter
private ExprNode file;

// Expose a parameter named "adapterInputSource" that will be an instance of some interface
// Interface implementations as parameters are declare a Map<String, Object>
@DataFlowOpParameter
private Map<String, Object> adapterInputSource;

// Expose a paramerer named "propertyNames" that is an array of string constants
@DataFlowOpParameter
private String[] propertyNames;

The forge class can obtain the output event type if needed. It should also validate the expression parameters and throw ExprValidationException if the parameter expression does not return the expected type. The utility class DataFlowParameterValidation has validate utility methods that return a validated expression: For example:

public DataFlowOpForgeInitializeResult initializeForge(DataFlowOpForgeInitializeContext context) throws ExprValidationException {
  // Obtain the declared output event type
  outputEventType = context.getOutputPorts().get(0).getOptionalDeclaredType() != null ? context.getOutputPorts().get(0).getOptionalDeclaredType().getEventType() : null;
  if (outputEventType == null) {
    throw new ExprValidationException("No event type provided for output, please provide an event type name");
  }

  // validate the "file" parameter expression expected to return a String-typed value
  file = DataFlowParameterValidation.validate("file", file, String.class, context);
  return null;
}

The forge class passes parameters to the factory. We use SAIFFInitializeBuilder that is a builder utility for building the factory. For example:

public CodegenExpression make(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
  return new SAIFFInitializeBuilder(FileSourceFactory.class, this.getClass(), "factory", parent, symbols, classScope)
    .exprnode("file", file)
    .constant("propertyNames", propertyNames)
    .map("adapterInputSource", adapterInputSource)
    .build();
}

The factory class must have setter-methods of the same name that receive the parameters:

private ExprEvaluator file;
private String[] propertyNames;
private Map<String, Object> adapterInputSource;

    public void setFile(ExprEvaluator file) {
        this.file = file;
    }
    
    public void setPropertyNames(String[] propertyNames) {
        this.propertyNames = propertyNames;
    }

    public void setAdapterInputSource(Map<String, Object> adapterInputSource) {
        this.adapterInputSource = adapterInputSource;
    }

The factory class can resolve parameter values by evaluating expressions and by determining whether parameters were passed as options. The DataFlowParameterResolution class provides convenience methods. For example:

public DataFlowOperator operator(DataFlowOpInitializeContext context) {
  String fileName = DataFlowParameterResolution.resolveWithDefault("file", file, null, String.class, context);
  AdapterInputSource adapterInputSourceInstance = DataFlowParameterResolution.resolveOptionalInstance("adapterInputSource", adapterInputSource, AdapterInputSource.class, context);
  return new MyOperator(fileName, adapterInputSourceInstance);
}

Chapter 20. Integration and Extension

20.1. Overview

20.2. Single-Row Function

20.2.1. Implementing a Single-Row Function
20.2.2. Configuring the Single-Row Function Name
20.2.3. Value Cache
20.2.4. Single-Row Functions in Filter Predicate Expressions
20.2.5. Single-Row Functions Taking Events as Parameters
20.2.6. Single-Row Functions Returning Events
20.2.7. Receiving a Context Object
20.2.8. Exception Handling

20.3. Virtual Data Window

20.3.1. How to Use
20.3.2. Implementing the Forge
20.3.3. Implementing the Factory-Factory
20.3.4. Implementing the Factory
20.3.5. Implementing the Virtual Data Window

20.4. Data Window View and Derived-Value View

20.4.1. Implementing a View Forge
20.4.2. Implementing a View Factory
20.4.3. Implementing a View
20.4.4. View Contract
20.4.5. Configuring View Namespace and Name
20.4.6. Requirement for Data Window Views
20.4.7. Requirement for Derived-Value Views

20.5. Aggregation Function

20.5.1. Aggregation Single-Function Development
20.5.2. Aggregation Multi-Function Development

20.6. Pattern Guard

20.6.1. Implementing a Guard Forge
20.6.2. Implementing a Guard Factory
20.6.3. Implementing a Guard Class
20.6.4. Configuring Guard Namespace and Name

20.7. Pattern Observer

20.7.1. Implementing an Observer Forge
20.7.2. Implementing an Observer Factory
20.7.3. Implementing an Observer Class
20.7.4. Configuring Observer Namespace and Name

20.1. Overview

This chapter summarizes integration and describes in detail each of the extension APIs that allow integrating external data and/or extend runtime functionality.

For information on calling external services via instance method invocation, for instance to integrate with dependency injection frameworks such as Spring or Guice, please see Section 5.17.5, “Class and Event-Type Variables”.

For information on input and output adapters that connect to an event transport and perform event transformation for incoming and outgoing on-the-wire event data, for use with streaming data, please see the EsperIO reference documentation. The data flow instances as described in Chapter 19, EPL Reference: Data Flow are an easy way to plug in operators that perform input and output. Data flows allow providing parameters and managing individual flows independent of runtime lifecycle. Also consider using the Plug-in Loader API for creating a new adapter that starts or stops as part of the CEP runtime initialization and destroy lifecycle, see Section 15.15, “Plug-In Loader”.

To join data that resides in a relational database and that is accessible via JDBC driver and SQL statement the runtime offers syntax for using SQL within EPL, see Section 5.13, “Accessing Relational Data via SQL”. A relational database input and output adapter for streaming input from and output to a relational database also exists (EsperIO).

To join data that resides in a non-relational store the runtime offers a two means: First, the virtual data window, as described below, for transparently integrating the external store as a named window. The second mechanism is a special join syntax based on static method invocation; see Section 5.14, “Accessing Non-Relational Data via Method, Script or UDF Invocation”.

Tip

The best way to test that your extension code works correctly is to write unit tests against a statement that utilizes the extension code. Samples can be obtained from Esper regression test code base.

Note

For all extension code and similar to listeners and subscribers, to send events into the runtime from extension code the routeEvent method should be used (and not sendEvent) to avoid the possibility of stack overflow due to event-callback looping and ensure correct processing of the current and routed event. Note that if outbound-threading is enabled, listeners and subscribers should use sendEvent and not routeEvent.

Note

For all extension code it is not safe to deploy and undeploy within the extension code. For example, it is not safe to implement a data window that deploys compiled modules and that undeploys deployments.

20.2. Single-Row Function

Single-row functions return a single value. They are not expected to aggregate rows but instead should be stateless functions. These functions can appear in any expressions and can be passed any number of parameters.

The following steps are required to develop and use a custom single-row function.

Implement a class providing one or more public static methods accepting the number and type of parameters as required.
Register the single-row function class and method name with the compiler by supplying a function name.

You may not override a built-in function with a single-row function provided by you. The single-row function you register must have a different name then any of the built-in functions.

An example single-row function can also be found in the examples under the runtime configuration example.

20.2.1. Implementing a Single-Row Function

Single-row function classes have no further requirement then provide a public static method.

The following sample single-row function simply computes a percentage value based on two number values.

This sample class provides a public static method by name computePercent to return a percentage value:

public class MyUtilityClass {
  public static double computePercent(double amount, double total) {
    return amount / total * 100;
  }
}

20.2.2. Configuring the Single-Row Function Name

The class name of the class, the method name and the function name of the new single-row function must be added to the compiler configuration. The configuration shown below is XML however the same options are available through the configuration API:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-singlerow-function name="percent" function-class="mycompany.MyUtilityClass" function-method="computePercent" />
  </compiler>
</esper-configuration>

Note that the function name and method name need not be the same.

The new single-row function is now ready to use in a statement:

select percent(fulfilled,total) from MyEvent

When selecting from a single stream, you may also pass wildcard to the single-row function and the function receives the underlying event:

select percent(*) from MyEvent

If the single-row function returns an object that provides further functions, you may chain function calls.

The following demonstrates a chained single-row function. The example assumes that a single-row function by name calculator returns an object that provides the add function which accepts two parameters:

select calculator().add(5, amount) from MyEvent

20.2.3. Value Cache

When a single-row function receives parameters that are all constant values or expressions that themselves receive only constant values, the runtime can pre-evaluate the result of the single-row function at time of statement. By default, the runtime does not pre-evaluate the single-row function unless you configure the value cache as enabled.

The following configuration XML enables the value cache for the single-row function:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-singlerow-function name="getDate" 
      function-class="mycompany.DateUtil" function-method="parseDate"
      value-cache="enabled" />
  </compiler>
</esper-configuration>

When the single-row function receives constants as parameters, the runtime computes the result once and returns the cached result for each evaluation:

select getDate('2002-05-30T9:00:00.000') from MyEvent

20.2.4. Single-Row Functions in Filter Predicate Expressions

Your EPL may use plug-in single row functions among the predicate expressions as part of the filters in a stream or pattern.

For example, the EPL below uses the function computeHash as part of a predicate expression:

select * from MyEvent(computeHash(field) = 100)

When you have many statements or many context partitions that refer to the same function, event type and parameters in a predicate expression, the compiler may optimize evaluation: The function gets evaluated only once per event.

While the optimization is enabled by default for all plug-in single row functions, you can also disable the optimization for a specific single-row function. By disabling the optimization for a single-row function the runtime may use less memory to identify reusable function footprints but may cause the runtime to evaluate each function more frequently than necessary.

The following configuration XML disables the filter optimization for a single-row function (by default it is enabled):

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-singlerow-function name="computeHash" 
      function-class="mycompany.HashUtil" function-method="computeHash"
      filter-optimizable="disabled" />
  </compiler>
</esper-configuration>

20.2.5. Single-Row Functions Taking Events as Parameters

EPL allows parameters to a single-row function to be events. In this case, declare the method parameter type to either take EventBean, Collection<EventBean> or the underlying class as a parameter.

Sample method footprints are:

public static double doCompute(EventBean eventBean) {...}
public static boolean doCheck(MyEvent myEvent, String text) {...}
public static String doSearch(Collection<EventBean> events) {...}

To pass the event, specify the stream alias, or wildcard (*) or the tag name when used in a pattern.

The EPL below shows example uses:

select * from MyEvent(doCompute(me) = 100) as me

select * from MyEvent where doCompute(*) = 100

select * from pattern[a=MyEvent -> MyEvent(doCheck(a, 'sometext'))]

select * from MyEvent#time(1 min) having doCompute(last(*))]

select * from MyEvent#time(1 min) having doSearch(window(*))]

Declare the method parameter as Collection<EventBean> if the method expects an expression result that returns multiple events.

Declare the method parameter as EventBean if the method expects an expression result that returns a single event.

20.2.6. Single-Row Functions Returning Events

A single-row function may return events. Please declare your single-row function method to return Collection<EventBean> or EventBean[] and configure the event type name.

For example, assuming there is an MyItem event type such as created via create schema MyItem(id string):

public static EventBean[] myItemProducer(String string, EPLMethodInvocationContext context) {
  String[] split = string.split(",");
  EventBean[] events = new EventBean[split.length];
  for (int i = 0; i < split.length; i++) {
    events[i] = context.getEventBeanService().adapterForMap(Collections.singletonMap("id", split[i]), "MyItem");
  }
  return events;
}

The sample EPL queries items filtering those items that have a given value for the id field:

select myItemProducer(ordertext).where(v => v.id in ('id1', 'id3')) as c0 from Order

This sample code register the myItemProducer function as a single-row function with an event type name:

ConfigurationCompilerPlugInSingleRowFunction entry = new ConfigurationCompilerPlugInSingleRowFunction();
entry.setName("myItemProducer");
entry.setFunctionClassName(...);
entry.setFunctionMethodName(...);
entry.setEventTypeName("MyItem");
Configuration configuration = new Configuration();
configuration.getCompiler().addPlugInSingleRowFunction(entry);

If your single row function returns EventBean[] and is used with enumeration methods the configuration must provide an event type name.

20.2.7. Receiving a Context Object

A sample method footprint and EPL are shown below:

public static double computeSomething(double number, EPLMethodInvocationContext context) {...}

select computeSomething(10) from MyEvent

20.2.8. Exception Handling

By default the runtime logs any exceptions thrown by the single row function and returns a null value. To have exceptions be re-thrown instead, which makes exceptions visible to any registered exception handler, please configure as discussed herein.

Set the rethrow-exceptions flag in the XML configuration or the rethrowExceptions flag in the API when registering the single row function to have the runtime re-throw any exceptions that the single row function may throw.

20.3. Virtual Data Window

Use a virtual data window if you have a (large) external data store that you want to access as a named window. The access is transparent: There is no need to use special syntax or join syntax. All regular queries including subqueries, joins, on-merge, on-select, on-insert, on-delete, on-update and fire-and-forget are supported with virtual data windows.

There is no need to keep any data or events in memory with virtual data windows. The only requirement for virtual data windows is that all data rows returned are EventBean instances.

When implementing a virtual data window it is not necessary to send any events into the runtime or to use insert-into. The event content is simply assumed to exist and accessible to the runtime via the API implementation you provide.

The distribution ships with a sample virtual data window in the examples folder under the name virtualdw. The code snippets below are extracts from the example.

We use the term store here to mean a source set of data that is managed by the virtual data window. We use the term store row or just row to mean a single data item provided by the store. We use the term lookup to mean a read operation against the store returning zero, one or many rows.

Virtual data windows allow high-performance low-latency lookup by exposing all relevant statement access path information. This makes it possible for the virtual data window to choose the desired access method into its store.

The following steps are required to develop and use a virtual data window:

Implement the interface com.espertech.esper.common.client.hook.vdw.VirtualDataWindowForge. This class is used by the compiler.
Implement the interface com.espertech.esper.common.client.hook.vdw.VirtualDataWindowFactoryFactory. This class is referred to, by class name, by the compiler. It is used at runtime.
Implement the interface com.espertech.esper.common.client.hook.vdw.VirtualDataWindowFactory (used at runtime only).
Implement the interface com.espertech.esper.common.client.hook.vdw.VirtualDataWindow (used at runtime only).
Implement the interface com.espertech.esper.common.client.hook.vdw.VirtualDataWindowLookup (used at runtime only).
Register the factory class in the configuration.

Once you have completed above steps, the virtual data window is ready to use in statements.

From a threading perspective, virtual data window implementation classes must be thread-safe if objects are shared between multiple named windows. If no objects are shared between multiple different named windows, thereby each object is only used for the same named window and other named windows receive a separate instance, it is no necessary that the implementation classes are thread-safe.

20.3.1. How to Use

Your application must first register the virtual data window factory as part of configuration:

Configuration config = new Configuration();
config.getCompiler().addPlugInVirtualDataWindow("sample", "samplevdw", 
    SampleVirtualDataWindowForge.class.getName());

Your application may then create a named window backed by a virtual data window.

For example, assume that the SampleEvent event type is declared as follows:

create schema SampleEvent as (key1 string, key2 string, value1 int, value2 double)

The next statement creates a named window MySampleWindow that provides SampleEvent events and is backed by a virtual data window:

create window MySampleWindow.sample:samplevdw() as SampleEvent

You may then access the named window, same as any other named window, for example by subquery, join, on-action, fire-and-forget query or by consuming its insert and remove stream. While this example uses Map-type events, the example code is the same for POJO or other events.

Your application may obtain a reference to the virtual data window from the runtime context.

This code snippet looks up the virtual data window by the named window name:

try {
  return (VirtualDataWindow) runtime.getContext().lookup("/virtualdw/MySampleWindow");
}
catch (NamingException e) {
  throw new RuntimeException("Failed to look up virtual data window, is it created yet?");
}

20.3.1.1. Query Access Path

When you application registers a subquery, join or on-action query or executes a fire-and-forget query against a virtual data window the runtime interacts with the virtual data window. The interaction is a two-step process.

At time of deployment (once), the runtime uses the information the compiler collected by analyzing the EPL where-clause, if present. It then creates a list of hash-index and binary tree (btree, i.e. sorted) index properties. It passes the property names that are queried as well as the operators (i.e. =, >, range etc.) to the virtual data window. The virtual data window returns a lookup strategy object to the runtime.

At time of statement execution (repeatedly as triggered), the runtime uses that lookup strategy object to execute a lookup. It passes to the lookup all actual key values (hash, btree including ranges) to make fast and efficient lookup achievable.

To explain in detail, assume that your application creates a statement with a subquery as follows:

select (select * from MySampleWindow where key1 = 'A1') from OtherEvent

At the time of compilation of the statement above the compiler analyzes the statement. It determines that the subquery queries a virtual data window. It determines from the where-clause that the lookup uses property key1 and hash-equals semantics. The runtime then provides this information as part of VirtualDataWindowLookupContext passed to the getLookup method. Your application may inspect hash and btree properties and may determine the appropriate store access method to use.

The hash and btree property lookup information is for informational purposes, to enable fast and performant queries that return the smallest number of rows possible. Your implementation classes may use some or none of the information provided and may also instead return some or perhaps even all rows, as is practical to your implementation. The where-clause still remains in effect and gets evaluated on all rows that are returned by the lookup strategy.

Following the above example, the sub-query executes once when a OtherEvent event arrives. At time of execution the runtime delivers the string value A1 to the VirtualDataWindowLookup lookup implementation provided by your application. The lookup object queries the store and returns store rows as EventBean instances.

As a second example, consider an EPL join statement as follows:

select * from MySampleWindow, MyTriggerEvent where key1 = trigger1 and key2 = trigger2

The compiler analyzes the statement and the runtime passes to the virtual data window the information that the lookup occurs on properties key1 and key2 under hash-equals semantics. When a MyTriggerEvent arrives, it passes the actual value of the trigger1 and trigger2 properties of the current MyTriggerEvent to the lookup.

As a last example, consider a fire-and-forget query as follows:

select * from MySampleWindow key1 = 'A2' and value1 between 0 and 1000

The compiler analyzes the statement and the runtime passes to the virtual data window the lookup information. The lookup occurs on property key1 under hash-equals semantics and on property value1 under btree-open-range semantics. When you application executes the fire-and-forget query the runtime passes A2 and the range endpoints 0 and 1000 to the lookup.

For more information, please consult the JavaDoc API documentation for class VirtualDataWindow, VirtualDataWindowLookupContext or VirtualDataWindowLookupFieldDesc.

20.3.2. Implementing the Forge

For each named window that refers to the virtual data window, the runtime instantiates one instance of the forge at compile-time.

A virtual data window forge class is responsible for the following functions:

Implement the initialize method that accepts a virtual data window forge context object as a parameter.
Implement the getFactoryMode method that information how to initialize the factory-factory class (the class that acts as a factory for virtual data window factories).
Implement the getUniqueKeyPropertyNames method that can return the set of property names that are unique keys, for the purpose of query planning.

The compiler instantiates a VirtualDataWindowForge instance for each named window created by create window. The compiler invokes the initialize method once in respect to the named window being created passing a VirtualDataWindowForgeContext context object.

The sample code shown here can be found among the examples in the distribution under virtualdw:

public class SampleVirtualDataWindowForge implements VirtualDataWindowForge {

    public void initialize(VirtualDataWindowForgeContext initializeContext) {
    }

    public VirtualDataWindowFactoryMode getFactoryMode() {
        // The injection strategy defines how to obtain and configure the factory-factory.
        InjectionStrategy injectionStrategy = new InjectionStrategyClassNewInstance(SampleVirtualDataWindowFactoryFactory.class);
        
        // The managed-mode is the default. It uses the provided injection strategy.
        VirtualDataWindowFactoryModeManaged managed = new VirtualDataWindowFactoryModeManaged();
        managed.setInjectionStrategyFactoryFactory(injectionStrategy);
        
        return managed;
    }

    public Set<String> getUniqueKeyPropertyNames() {
        // lets assume there is no unique key property names
        return null;
    }
}

Your forge class must implement the getFactoryMode method which instructs the compiler how to obtain a factory class that returns a factory for creating virtual data window instances (a factory-factory). The class acting as the factory-factory will be SampleVirtualDataWindowFactoryFactory.

20.3.3. Implementing the Factory-Factory

At deployment time, the runtime instantiates the factory-factory and obtains a factory for virtual data windows.

A virtual data window factory-factory class is responsible for the following functions:

Implement the createFactory method that accepts a factory-factory context and that returns the virtual data window factory.

The sample code shown here can be found among the examples in the distribution under virtualdw:

public class SampleVirtualDataWindowFactoryFactory implements VirtualDataWindowFactoryFactory {

    public VirtualDataWindowFactory createFactory(VirtualDataWindowFactoryFactoryContext ctx) {
        return new SampleVirtualDataWindowFactory();
    }
}

20.3.4. Implementing the Factory

For each named window that refers to the virtual data window, the runtime instantiates one instance of the factory.

A virtual data window factory class is responsible for the following functions:

Implement the initialize method that accepts a virtual data window factory context object as a parameter.
Implement the create method that accepts a virtual data window context object as a parameter and returns a VirtualDataWindow implementation.
Implement the destroy method that gets called once when the named window is undeployed.

The runtime instantiates a VirtualDataWindowFactory instance for each named window created via create window. The runtime invokes the initialize method once in respect to the named window being created passing a VirtualDataWindowFactoryContext context object.

If not using contexts, the runtime calls the create method once after calling the initialize method. If using contexts, the runtime calls the create method every time it allocates a context partition. If using contexts and your virtual data window implementation operates thread-safe, you may return the same virtual data window implementation object for each context partition. If using contexts and your implementation object is not thread safe, return a separate thread-safe implementation object for each context partition.

The runtime invokes the destroy method once when the named window is undeployed. If not using contexts, the runtime calls the destroy method of the virtual data window implementation object before calling the destroy method on the factory object. If using contexts, the runtime calls the destroy method on each instance associates to a context partition at the time the associated context partition terminates.

The sample code shown here can be found among the examples in the distribution under virtualdw:

public class SampleVirtualDataWindowFactory implements VirtualDataWindowFactory {

    public void initialize(VirtualDataWindowFactoryContext factoryContext) {
    }

    public VirtualDataWindow create(VirtualDataWindowContext context) {
        return new SampleVirtualDataWindow(context);
    }

    public void destroy() {
        // cleanup can be performed here
    }

    public Set<String> getUniqueKeyPropertyNames() {
        // lets assume there is no unique key property names
        return null;
    }
}

Your factory class must implement the create method which receives a VirtualDataWindowContext object. This method is called once for each EPL that creates a virtual data window (see example create window above).

The VirtualDataWindowContext provides to your application:

String namedWindowName;	// Name of named window being created.
Object[] parameters;  // Any optional parameters provided as part of create-window.
EventType eventType;  // The event type of events.
EventBeanFactory eventFactory;  // A factory for creating EventBean instances from store rows.
VirtualDataWindowOutStream outputStream;  // For stream output to consuming statements.
AgentInstanceContext agentInstanceContext;  // Other statement information in statement context.

When using contexts you can decide whether your factory returns a new virtual data window for each context partition or returns the same virtual data window instance for all context partitions. Your extension code may refer to the named window name to identify the named window and may refer to the agent instance context that holds the agent instance id which is the id of the context partition.

20.3.5. Implementing the Virtual Data Window

A virtual data window implementation is responsible for the following functions:

Accept the lookup context object as a parameter and return the VirtualDataWindowLookup implementation.
Optionally, post insert and remove stream data.
Implement the destroy method, which the runtime calls for each context partition when the named window is stopped or destroyed, or once when a context partition is ended/terminated.

The sample code shown here can be found among the examples in the distribution under virtualdw.

The implementation class must implement the VirtualDataWindow interface like so:

public class SampleVirtualDataWindow implements VirtualDataWindow {

  private final VirtualDataWindowContext context;
  
  public SampleVirtualDataWindow(VirtualDataWindowContext context) {
    this.context = context;
  } ...

When the compiler compiles a statement and detects a virtual data window, the compiler compiles access path information and the runtime invokes the getLookup method indicating hash and btree access path information by passing a VirtualDataWindowLookupContext context. The lookup method must return a VirtualDataWindowLookup implementation that the statement uses for all lookups until the statement is stopped or destroyed.

The sample implementation does not use the hash and btree access path information and simply returns a lookup object:

public VirtualDataWindowLookup getLookup(VirtualDataWindowLookupContext desc) {

  // Place any code that interrogates the hash-index and btree-index fields here.

  // Return the lookup strategy.
  return new SampleVirtualDataWindowLookup(context);
}

The runtime calls the update method when data changes because of on-merge, on-delete, on-update or insert-into. For example, if you have an on-merge statement that is triggered and that updates the virtual data window, the newData parameter receives the new (updated) event and the oldData parameter receives the event prior to the update. Your code may use these events to update the store or delete from the store, if needed.

If your application plans to consume data from the virtual data window, for example via select * from MySampleWindow, then the code must implement the update method to forward insert and remove stream events, as shown below, to receive the events in consuming statements. To post insert and remove stream data, use the VirtualDataWindowOutStream provided by the context object as follows.

public void update(EventBean[] newData, EventBean[] oldData) {
  // This sample simply posts into the insert and remove stream what is received.
  context.getOutputStream().update(newData, oldData);
}

Your application should not use VirtualDataWindowOutStream to post new events that originate from the store. The object is intended for use with on-action statements. Use insert-into instead for any new events that originate from the store.

20.4. Data Window View and Derived-Value View

Views in EPL are used to derive information from an event stream, and to represent data windows onto an event stream. This chapter describes how to plug-in a new, custom view.

The following steps are required to develop and use a custom view.

Implement a view forge class. View forges are compile-time classes that accept and check view parameters and refer to the appropriate view factory for the runtime.
Implement a view factory class. View factories are classes that instantiate the appropriate view class at runtime.
Implement a view class. A view class commonly represents a data window or derives new information from a stream at runtime.
Configure the view factory class supplying a view namespace and name in the compiler configuration.

The example view factory and view class that are used in this chapter can be found in the examples source folder in the OHLC (open-high-low-close) example. The class names are OHLCBarPlugInViewForge, OHLCBarPlugInViewFactory and OHLCBarPlugInView.

Views can make use of the runtime services available via StatementContext, for example:

The SchedulingService interface allows views to schedule timer callbacks to a view

Section 20.4.4, “View Contract” outlines the requirements for correct behavior of your custom view within the runtime.

Note that custom views may use runtime services and APIs that can be subject to change between major releases. The runtime services discussed above and view APIs are considered part of the runtime internal API and are only limited stable. Please also consider contributing your custom view to the project by submitting the view code.

20.4.1. Implementing a View Forge

A view forge class is a compile-time class and is responsible for the following functions:

Accept zero, one or more view parameters. View parameters are themselves expressions. The view forge must validate the expressions.
Build the view factory class. At deployment-time this code executes and builds the view factory.
Provide information about the event type of events posted by the view.

View forge classes must implement the ViewFactoryForge interface. Additionally a view forge class must implement the DataWindowViewForge interface if the view is a data window (retains events provided to it).

public class OHLCBarPlugInViewForge implements ViewFactoryForge { ...

Your view forge class must implement the setViewParameters method to accept view parameters and the attach method to attach the view to a stream:

public class OHLCBarPlugInViewForge implements ViewFactoryForge {
    private List<ExprNode> viewParameters;
    private ExprNode timestampExpression;
    private ExprNode valueExpression;
    private EventType eventType;

    public void setViewParameters(List<ExprNode> parameters, ViewForgeEnv viewForgeEnv, int streamNumber) throws ViewParameterException {
        this.viewParameters = parameters;
    }

    public void attach(EventType parentEventType, int streamNumber, ViewForgeEnv env) throws ViewParameterException {
        if (viewParameters.size() != 2) {
            throw new ViewParameterException("View requires a two parameters: the expression returning timestamps and the expression supplying OHLC data points");
        }
        ExprNode[] validatedNodes = ViewForgeSupport.validate("OHLC view", parentEventType, viewParameters, false, env, streamNumber);

        timestampExpression = validatedNodes[0];
        valueExpression = validatedNodes[1];

        if ((timestampExpression.getForge().getEvaluationType() != long.class) && (timestampExpression.getForge().getEvaluationType() != Long.class)) {
            throw new ViewParameterException("View requires long-typed timestamp values in parameter 1");
        }
        if ((valueExpression.getForge().getEvaluationType() != double.class) && (valueExpression.getForge().getEvaluationType() != Double.class)) {
            throw new ViewParameterException("View requires double-typed values for in parameter 2");
        }
        ....

After the compiler supplied view parameters to the forge, the compiler will ask the view to attach to its parent and validate any parameter expressions against the parent view's event type. If the view will be generating events of a different type then the events generated by the parent view, then the view factory can allocate the new event type.

Finally, the compiler asks the view forge to generate code that initializes the view factory:

public CodegenExpression make(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
    return new SAIFFInitializeBuilder(OHLCBarPlugInViewFactory.class, this.getClass(), "factory", parent, symbols, classScope)
                .exprnode("timestampExpression", timestampExpression)
                .exprnode("valueExpression", valueExpression)
                .build();
}

Use the internal SAIFFInitializeBuilder to build your view factory providing it the expressions and other values it needs.

20.4.2. Implementing a View Factory

A view factory class is responsible for the following functions:

Implement initialization code when required.
Instantiate the actual view class.
Provide information about the event type of events posted by the view.

View factory classes implement the ViewFactory interface. Additionally a view factory class must implement the DataWindowViewFactory interface if the view is a data window (retains events provided to it).

public class OHLCBarPlugInViewFactory implements ViewFactory { ...

The runtime initializes a view factory by calling its init method.

The runtime asks the view factory to create a view instance, and asks for the type of event generated by the view:

public View makeView(AgentInstanceViewFactoryChainContext agentInstanceViewFactoryContext) {
    return new OHLCBarPlugInView(this, agentInstanceViewFactoryContext);
}

public EventType getEventType() {
    return eventType;
}

20.4.3. Implementing a View

A view class is responsible for:

The update method receives insert streams and remove stream events from its parent view
The iterator method supplies an (optional) iterator to allow an application to pull or request results from an EPStatement

View classes subclass ViewSupport. Additionally a view class must implement the DataWindowView interface if the view is a data window (retains events provided to it).

public class OHLCBarPlugInView extends ViewSupport { ...

Your view's update method will be processing incoming (insert stream) and outgoing (remove stream) events posted by the parent view (if any), as well as providing incoming and outgoing events to child views. The convention required of your update method implementation is that the view releases any insert stream events (EventBean object references) which the view generates as reference-equal remove stream events (EventBean object references) at a later time.

The view implementation must call child.update(...) to post outgoing insert and remove stream events. Similar to the update method, the child.update takes insert and remove stream events as parameters.

A sample update method implementation is provided in the OHLC example.

20.4.4. View Contract

The update method must adhere to the following conventions, to prevent memory leaks and to enable correct behavior within the runtime:

A view implementation that posts events to the insert stream must post unique EventBean object references as insert stream events, and cannot post the same EventBean object reference multiple times. The underlying event to the EventBean object reference can be the same object reference, however the EventBean object reference posted by the view into the insert stream must be a new instance for each insert stream event.
If the custom view posts a continuous insert stream, then the views must also post a continuous remove stream (second parameter to the updateChildren method). If the view does not post remove stream events, it assumes unbound keep-all semantics.
EventBean events posted as remove stream events must be the same object reference as the EventBean events posted as insert stream by the view. Thus remove stream events posted by the view (the EventBean instances, does not affect the underlying representation) must be reference-equal to insert stream events posted by the view as part of an earlier invocation of the update method, or the same invocation of the update method.
EventBean events represent a unique observation. The values of the observation can be the same, thus the underlying representation of an EventBean event can be reused, however event property values must be kept immutable and not be subject to change.
Array elements of the insert and remove stream events must not carry null values. Array size must match the number of EventBean instances posted. It is recommended to use a null value for no insert or remove stream events rather then an empty zero-size array.

Your view implementation must implement the AgentInstanceStopCallback interface to receive a callback when the view gets destroyed.

Please refer to the sample views for a code sample on how to implement the iterator method.

In terms of multiple threads accessing view state, there is no need for your custom view factory or view implementation to perform any synchronization to protect internal state. The iterator of the custom view implementation does also not need to be thread-safe. The runtime ensures the custom view executes in the context of a single thread at a time. If your view uses shared external state, such external state must be still considered for synchronization when using multiple threads.

20.4.5. Configuring View Namespace and Name

The view factory class name as well as the view namespace and name for the new view must be added to the compiler configuration. The configuration shown below is XML however the same options are available through configuration:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-view namespace="custom" name="ohlc" 
        forge-class="com.espertech.esper.example.ohlc.OHLCBarPlugInViewFactory" /> 
  </compiler>
</esper-configuration>

The new view is now ready to use in a statement:

select * from StockTick.custom:ohlc(timestamp, price)

Note that the view must implement additional interfaces if it acts as a data window view, or works in a grouping context, as discussed in detail below.

20.4.6. Requirement for Data Window Views

Your custom view may represent an expiry policy and may retain events and thus act as a data window view. In order to allow the compiler to validate that your view can be used with named windows, which allow only data window views, this section documents any additional requirement that your classes must fulfill.

Your view forge class must implement the DataWindowViewForge interface. This marker interface (no methods required) indicates that your views are data window views.

Your view factory class must implement the DataWindowViewFactory interface. This marker interface (no methods required) indicates that your views are data window views.

Your view class must implement the DataWindowView interface. This interface indicates that your view is a data window view and therefore eligible to be used in any construct that requires a data window view. The DataWindowView interface extends the ViewDataVisitable interface. Please provide an empty implementation method for the visitView method as required by ViewDataVisitable (the default behavior is sufficient).

20.4.7. Requirement for Derived-Value Views

Your custom view may compute derived information from the arriving stream, instead of retaining events, and thus act as a derived-value view.

Your view class should implement the DerivedValueView interface. This marker interface indicates that your view is a derived-value view, affecting correct behavior of the view when used in joins.

20.5. Aggregation Function

Aggregation functions are stateful functions that aggregate events, event property values or expression results. Examples for built-in aggregation functions are count(*), sum(price * volume), window(*) or maxby(volume).

EPL allows two different ways for your application to provide aggregation functions. We use the name aggregation single-function and aggregation multi-function for the two independent extension APIs for aggregation functions.

The aggregation single-function API is simple to use however it imposes certain restrictions on how expressions that contain aggregation functions share state and how they are evaluated.

The aggregation multi-function API is more powerful and provides control over how expressions that contain aggregation functions share state and are evaluated.

The next table compares the two aggregation function extension API's:

Table 20.1. Aggregation Function Extension API's

	Single-Function	Multi-Function
Return Value	Can only return a single value or object. Cannot return an `EventBean` event, collection of `EventBean` events or collection or array of values for use with enumeration methods, for example.	Can return an `EventBean` event, a collection of `EventBean` events or a collection or array of objects for use with enumeration methods or to access event properties.
Complexity of API	Simple (consists of 2 interfaces).	More complex (consists of 6 interfaces).
State Sharing	State and parameter evaluation shared if multiple aggregation functions of the same name in the same statement (and context partition) take the exact same parameter expressions.	State and parameter evaluation sharable when multiple aggregation functions of a related name (related thru configuration) for the same statement (and context partition) exist, according to a sharing-key provided by your API implementation.
Function Name	Each aggregation function expression receives its own factory object.	Multiple related aggregation function expressions share a single factory object.
Distinct Keyword	Handled by the runtime transparently depending on mode.	Indicated to the API implementation only.

The following sections discuss developing an aggregation single-function first, followed by the subject of developing an aggregation multi-function.

Note

The aggregation multi-function API is a powerful and lower-level API to extend the runtime. Any classes that are not part of the client package should be considered unstable and are subject to change between minor and major releases.

20.5.1. Aggregation Single-Function Development

This section describes the aggregation single-function extension API for providing aggregation functions.

The following steps are required to develop and use a custom aggregation single-function.

Implement an aggregation function forge by implementing the interface com.espertech.esper.common.client.hook.aggfunc.AggregationFunctionForge. This class provides compile-time information.
Implement an aggregation function factory by implementing the interface com.espertech.esper.common.client.hook.aggfunc.AggregationFunctionFactory (used at runtime).
Implement an aggregation function by implementing the interface com.espertech.esper.common.client.hook.aggfunc.AggregationFunction (used at runtime).
Register the aggregation single-function forge class with the compiler by supplying a function name, via the compiler configuration.

Custom aggregation functions can also be passed multiple parameters, as further described in Section 20.5.1.5, “Aggregation Single-Function: Accepting Multiple Parameters”. In the example below the aggregation function accepts a single parameter.

The code for the example aggregation function as shown in this chapter can be found in the runtime configuration example in the package com.espertech.esper.example.runtimeconfig by the name MyConcatAggregationFunction. The sample function simply concatenates string-type values.

20.5.1.1. Implementing an Aggregation Single-Function Forge

An aggregation function forge class is only used at compile-time and is responsible for the following functions:

Implement a setFunctionName method that receives the function name.
Implement a validate method that validates the value type of the data points that the function must process.
Implement a getValueType method that returns the type of the aggregation value generated by the aggregation function instances. For example, the built-in count aggregation function returns Long.class as it generates long -typed values.
Implement a getAggregationFunctionMode which provided information about the factory class to the compiler.

Aggregation forge classes implement the interface AggregationFunctionForge:

public class MyConcatAggregationFunctionForge implements AggregationFunctionForge { ...

The compiler constructs one instance of the aggregation function forge class for each time the function is listed in a statement, however the compiler may decide to reduce the number of aggregation forge instances if it finds equivalent aggregations.

The aggregation function forge instance receives the aggregation function name via set setFunctionName method.

The sample concatenation function forge provides an empty setFunctionName method:

public void setFunctionName(String functionName) {
  // no action taken
}

An aggregation function forge must provide an implementation of the validate method that is passed a AggregationFunctionValidationContext validation context object. Within the validation context you find the result type of each of the parameters expressions to the aggregation function as well as information about constant values and data window use. Please see the JavaDoc API documentation for a comprehensive list of validation context information.

Since the example concatenation function requires string types it implements a type check:

public void validate(AggregationValidationContext validationContext) {
  if ((validationContext.getParameterTypes().length != 1) ||
    (validationContext.getParameterTypes()[0] != String.class)) {
    throw new IllegalArgumentException("Concat aggregation requires a single parameter of type String");
  }
}

In order for the compiler to validate the type returned by the aggregation function against the types expected by enclosing expressions, the getValueType must return the result type of any values produced by the aggregation function:

public Class getValueType() {
  return String.class;
}

Finally the forge implementation must provide a getAggregationFunctionMode method that returns information about the factory. The compiler uses this information to build the aggregation function factory.

public AggregationFunctionMode getAggregationFunctionMode() {
    // Inject a factory by using "new"
    InjectionStrategy injectionStrategy = new InjectionStrategyClassNewInstance(MyConcatAggregationFunctionFactory.class);
    
    // The managed mode means there is no need to write code that generates code
    AggregationFunctionModeManaged mode = new AggregationFunctionModeManaged();
    mode.setInjectionStrategyAggregationFunctionFactory(injectionStrategy);
        
    return mode;
}

20.5.1.2. Implementing an Aggregation Single-Function Factory

An aggregation function factory class is responsible for the following functions:

Implement a newAggregator method that instantiates and returns an aggregation function instance.

Aggregation function factory classes implement the interface AggregationFunctionFactory:

public class MyConcatAggregationFunctionFactory implements AggregationFunctionFactory { ...

The runtime constructs the aggregation function factory at time of deployment.

The factory must provide a newAggregator method that returns instances of AggregationFunction. The runtime invokes this method for each new aggregation state to be allocated.

public AggregationFunction newAggregator() {
  return new MyConcatAggregationFunction();
}

20.5.1.3. Implementing an Aggregation Single-Function

An aggregation function class is responsible for the following functions:

Implement an enter method that the runtime invokes to add a data point into the aggregation, when an event enters a data window
Implement a leave method that the runtime invokes to remove a data point from the aggregation, when an event leaves a data window
Implement a getValue method that returns the current value of the aggregation.
Implement a clear method that resets the current value.

Aggregation function classes implement the interface AggregationFunction:

public class MyConcatAggregationFunction implements AggregationFunction { ...

The class that provides the aggregation and implements AggregationFunction does not have to be threadsafe.

The constructor initializes the aggregation function:

public class MyConcatAggregationFunction implements AggregationFunction {
  private final static char DELIMITER = ' ';
  private StringBuilder builder;
  private String delimiter;

  public MyConcatAggregationFunction() {
    builder = new StringBuilder();
    delimiter = "";
  }
  ...

The enter method adds a datapoint to the current aggregation value. The example enter method shown below adds a delimiter and the string value to a string buffer:

public void enter(Object value) {
  if (value != null) {
    builder.append(delimiter);
    builder.append(value.toString());
    delimiter = String.valueOf(DELIMITER);
  }
}

Conversly, the leave method removes a datapoint from the current aggregation value. The example leave method removes from the string buffer:

public void leave(Object value) {
  if (value != null) {
    builder.delete(0, value.toString().length() + 1);
  }
}

Finally, the runtime obtains the current aggregation value by means of the getValue method:

public Object getValue() {
  return builder.toString();
}

For on-demand queries the aggregation function must support resetting its value to empty or start values. Implement the clear function to reset the value as shown below:

public void clear() {
  builder = new StringBuilder();
  delimiter = "";
}

20.5.1.4. Configuring the Aggregation Single-Function Name

The aggregation function class name as well as the function name for the new aggregation function must be added to the compiler configuration. The configuration shown below is XML however the same options are available through the configuration API:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-aggregation-function name="concat" 
      forge-class="com.espertech.esper.example.runtimeconfig.MyConcatAggregationFunctionFactory" />
  </compiler>
</esper-configuration>

The new aggregation function is now ready to use in a statement:

select concat(symbol) from StockTick#length(3)

20.5.1.5. Aggregation Single-Function: Accepting Multiple Parameters

Your plug-in aggregation function may accept multiple parameters. You must provide a different mode however:

    public AggregationFunctionMode getAggregationFunctionMode() {
        InjectionStrategy injectionStrategy = new InjectionStrategyClassNewInstance(SupportCountBackAggregationFunctionFactory.class);

        AggregationFunctionModeMultiParam multiParam = new AggregationFunctionModeMultiParam();
        multiParam.setInjectionStrategyAggregationFunctionFactory(injectionStrategy);
        
        return multiParam;
    }

For instance, assume an aggregation function rangeCount that counts all values that fall into a range of values. The EPL that calls this function and provides a lower and upper bounds of 1 and 10 is:

select rangeCount(1, 10, myValue) from MyEvent

The enter method of the plug-in aggregation function may look as follows:

public void enter(Object value)  {
  Object[] params = (Object[]) value;
  int lower = (Integer) params[0];
  int upper = (Integer) params[1];
  int val = (Integer) params[2];
  if ((val >= lower) && (val <= upper)) {
    count++;
  }
}

Your plug-in aggregation function may want to validate parameter types or may want to know which parameters are constant-value expressions. Constant-value expressions are evaluated only once by the runtime and could therefore be cached by your aggregation function for performance reasons. The runtime provides constant-value information as part of the AggregationValidationContext passed to the validate method.

20.5.1.6. Aggregation Single-Function: The Filter Parameter

When using AggregationFunctionModeManaged the runtime already takes care of filters.

When using AggregationFunctionModeMultiParam, the compiler takes the filter named parameter filter expression as a boolean-type value and the runtime provides the value to your enter method as the last value in the parameter array.

For instance, assume an aggregation function concat that receives a word value and that has a filter expression as parameters:

select concat(word, filter: word not like '%jim%') from MyWordEvent

The enter method of the plug-in aggregation function may look as follows:

public void enter(Object value)  {
  Object[] arr = (Object[]) value;
  Boolean pass = (Boolean) arr[1];
  if (pass != null && pass) {
    buffer.append(arr[0].toString());
  }
}

Your code can obtain the actual filter expression from the AggregationValidationContext that is passed to the validate method and that returns the named parameters via getNamedParameters.

20.5.1.7. Aggregation Single-Function: Distinct

When using AggregationFunctionModeManaged the runtime already takes care of distinct.

When using AggregationFunctionModeMultiParam your application code must determine and process distinct.

20.5.1.8. Aggregation Single-Function: Dot-Operator Use

When the custom aggregation function returns an object as a return value, the EPL can use parenthesis and the dot-operator to invoke methods on the return value.

The following example assumes that the myAggregation custom aggregation function returns an object that has getValueOne and getValueTwo methods:

select (myAggregation(myValue)).getValueOne(),  (myAggregation(myValue)).getValueTwo() from MyEvent

Since the above EPL aggregates the same value, the runtime internally uses a single aggregation to represent the current value of myAggregation (and not two instances of the aggregation, even though myAggregation is listed twice).

20.5.2. Aggregation Multi-Function Development

This section introduces the aggregation multi-function API. Please refer to the JavaDoc for more complete class and method-level documentation.

Among the examples is an example use of the aggregation multi-function API in the example by name Cycle-Detect. Cycle-Detect takes incoming transaction events that have from-account and to-account fields. The example detects a cycle in the transactions between accounts in order to detect a possible transaction fraud. Please note that the graph and cycle detection logic of the example is not part of the distribution: The example utilizes the jgrapht library.

In the Cycle-Detect example, the vertices of a graph are the account numbers. For example the account numbers Acct-1, Acct-2 and Acct-3. In the graph the edges are transaction events that identify a from-account and a to-account. An example edge is {from:Acct-1, to:Acct-2}. An example cycle is therefore in the three transactions {from:Acct-1, to:Acct-2}, {from:Acct-2, to:Acct-3} and {from:Acct-3, to:Acct-1}.

The code for the example aggregation multi-function as shown in this chapter can be found in the Cycle-Detect example in the package com.espertech.esper.example.cycledetect. The example provides two aggregation functions named cycledetected and cycleoutput:

The cycledetected function returns a boolean value whether a graph cycle is found or not.
The cycleoutput function outputs the vertices (account numbers) that are part of the graph cycle.

In the Cycle-Detect example, the following statement utilizes the two functions cycledetected and cycleoutput that share the same graph state to detect a cycle among the last 1000 events:

@Name('CycleDetector') select cycleoutput() as cyclevertices
from TransactionEvent#length(1000)
having cycledetected(fromAcct, toAcct)

If instead the goal is to run graph cycle detection every 1 second (and not upon arrival of a new event), this sample statement uses a pattern to trigger cycle detection:

@Name('CycleDetector')
select (select cycleoutput(fromAcct, toAcct) from TransactionEvent#length(1000)) as cyclevertices
from pattern [every timer:interval(1)]

The following steps are required to develop and use a custom aggregation multi-function.

Implement an aggregation multi-function forge by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionForge.
Implement one or more handlers for aggregation functions by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionHandler.
Implement an aggregation state key by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionStateKey.
Implement an aggregation state factory by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionStateFactory.
Implement an aggregation state holder by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionState.
Implement a state accessor factory by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionAccessorFactory.
Implement a state accessor by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionAccessor.
For use with tables, implement an agent factory by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionAgentFactory.
For use with tables, implement an agent by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionAgent.
For use with tables, implement an table reader factory by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionTableReaderFactory.
For use with tables, implement a table reader by implementing the interface com.espertech.esper.common.client.hook.aggmultifunc.AggregationMultiFunctionTableReader.
Register the aggregation multi-function forge class with the compiler by supplying one or more function names, via the compiler configuration file or the runtime and static configuration API.

20.5.2.1. Implementing an Aggregation Multi-Function Forge

An aggregation multi-function forge class is a compile-time class responsible for the following functions:

Implement the addAggregationFunction method that receives an invocation for each aggregation function declared in the statement that matches any of the function names provided at configuration time.
Implement the validateGetHandler method that receives an invocation for each aggregation function to be validated in the statement that matches any of the function names provided at configuration time.

Aggregation multi-function factory classes implement the interface AggregationMultiFunctionForge:

public class CycleDetectorAggregationForge implements AggregationMultiFunctionForge { ...

The compiler constructs a single instance of the aggregation multi-function forge class that is shared for all aggregation function expressions in a statement that have one of the function names provided in the configuration object.

The compiler invokes the addAggregationFunction method at the time it compiles a statement. The method receives a declaration-time context object that provides the function name as well as additional information.

The sample Cycle-Detect factory class provides an empty addAggregationFunction method:

public void addAggregationFunction(AggregationMultiFunctionDeclarationContext declarationContext) {
    // provides an opportunity to inspect where used
}

The compiler invokes the validateGetHandler method at the time of expression validation. It passes a AggregationMultiFunctionValidationContext validation context object that contains actual parameters expressions. Please see the JavaDoc API documentation for a comprehensive list of validation context information.

The validateGetHandler method must return a handler object the implements the AggregationMultiFunctionHandler interface. Return a handler object for each aggregation function expression according to the aggregation function name and its parameters that are provided in the validation context.

The example cycledetect function takes two parameters that provide the cycle edge (from-account and to-account):

public AggregationMultiFunctionHandler validateGetHandler(AggregationMultiFunctionValidationContext validationContext) {
  if (validationContext.getParameterExpressions().length == 2) {
    fromExpression = validationContext.getParameterExpressions()[0];
    toExpression = validationContext.getParameterExpressions()[1];
  }
  return new CycleDetectorAggregationHandler(this, validationContext);
}

20.5.2.2. Implementing an Aggregation Multi-Function Handler

An aggregation multi-function handler class is a compile-time class that must implement the AggregationMultiFunctionHandler interface and is responsible for the following functions:

Implement the getReturnType method that returns information about the type of return values provided.
Implement the getAggregationStateUniqueKey method that provides a key object used by the compiler to determine which aggregation functions share state.
Implement the getStateMode method that returns information to the compiler that the compiler uses to initialize the state factory at deployment time.
Implement the getAccessorMode method that returns information to the compiler that the compiler uses to initialize the accessor factory at deployment time.
Implement the getAgentMode method that returns information to the compiler that the compiler uses to initialize the agent factory at deployment time, for use with tables.
Implement the getTableReaderMode method that returns information to the compiler that the compiler uses to initialize the table reader factory at deployment time, for use with tables.

In the Cycle-Detect example, the class CycleDetectorAggregationHandler is the handler for all aggregation functions.

public class CycleDetectorAggregationHandler implements AggregationMultiFunctionHandler { ...

The getReturnType method provided by the handler instructs the compiler about the return type of each aggregation accessor. The class EPType holds return type information.

In the Cycle-Detect example the cycledetected function returns a single boolean value. The cycleoutput returns a collection of vertices:

public EPType getReturnType() {
    if (validationContext.getFunctionName().toLowerCase(Locale.ENGLISH).equals(CycleDetectorConstant.CYCLEOUTPUT_NAME)) {
        return EPTypeHelper.collectionOfSingleValue(forge.getFromExpression().getForge().getEvaluationType());
    }
    return EPTypeHelper.singleValue(Boolean.class);
}

The compiler invokes the getAggregationStateUniqueKey method to determine whether multiple aggregation function expressions in the same statement can share the same aggregation state or should receive different aggregation state instances.

The getAggregationStateUniqueKey method must return an instance of AggregationMultiFunctionStateKey. The compiler uses equals-semantics (the hashCode and equals methods) to determine whether multiple aggregation function share the state object. If the key object returned for each aggregation function by the handler is an equal key object then the compiler shares aggregation state between such aggregation functions for the same statement and context partition.

In the Cycle-Detect example the state is shared, which it achieves by simply returning the same key instance:

private static final AggregationMultiFunctionStateKey CYCLE_KEY = new AggregationMultiFunctionStateKey() {};

public AggregationMultiFunctionStateKey getAggregationStateUniqueKey() {
    return CYCLE_KEY;
}

The compiler invokes the getStateMode method to obtain an instance of AggregationMultiFunctionStateMode. The state mode is responsible to obtaining and configuring an aggregation state factory instance at time of deployment.

In the Cycle-Detect example the method passes the expression evaluators providing the from-account and to-account expressions to the state factory:

public AggregationMultiFunctionStateMode getStateMode() {
    AggregationMultiFunctionStateModeManaged managed = new AggregationMultiFunctionStateModeManaged();
    InjectionStrategyClassNewInstance injection = new InjectionStrategyClassNewInstance(CycleDetectorAggregationStateFactory.class);
    injection.addExpression("from", forge.getFromExpression());
    injection.addExpression("to", forge.getToExpression());
    managed.setInjectionStrategyAggregationStateFactory(injection);
    return managed;
}

The compiler invokes the getAccessorMode method to obtain an instance of AggregationMultiFunctionAccessorMode. The accessor mode is responsible to obtaining and configuring an accessor factory instance at time of deployment.

The getAccessorMode method provides information about the accessor factories according to whether the aggregation function name is cycledetected or cycleoutput:

public AggregationMultiFunctionAccessorMode getAccessorMode() {
    Class accessor;
    if (validationContext.getFunctionName().toLowerCase(Locale.ENGLISH).equals(CycleDetectorConstant.CYCLEOUTPUT_NAME)) {
        accessor = CycleDetectorAggregationAccessorOutputFactory.class;
    }
    else {
        accessor = CycleDetectorAggregationAccessorDetectFactory.class;
    }
    AggregationMultiFunctionAccessorModeManaged managed = new AggregationMultiFunctionAccessorModeManaged();
    InjectionStrategyClassNewInstance injection = new InjectionStrategyClassNewInstance(accessor);
    managed.setInjectionStrategyAggregationAccessorFactory(injection);
    return managed;
}

20.5.2.3. Implementing an Aggregation Multi-Function State Factory

An aggregation multi-function state factory class must implement the AggregationMultiFunctionStateFactory interface and is responsible for the following functions:

Implement the newState method that returns an aggregation state holder.

The runtime invokes the newState method to obtain a new aggregation state instance before applying aggregation state. If using group by in your statement, the runtime invokes the newState method to obtain a state holder for each group.

In the Cycle-Detect example, the class CycleDetectorAggregationStateFactory is the state factory for all aggregation functions:

public class CycleDetectorAggregationStateFactory implements AggregationMultiFunctionStateFactory {

    private ExprEvaluator from;
    private ExprEvaluator to;

    public AggregationMultiFunctionState newState(AggregationMultiFunctionStateFactoryContext ctx) {
        return new CycleDetectorAggregationState(this);
    }

    public void setFrom(ExprEvaluator from) {
        this.from = from;
    }

    public void setTo(ExprEvaluator to) {
        this.to = to;
    }

    public ExprEvaluator getFrom() {
        return from;
    }

    public ExprEvaluator getTo() {
        return to;
    }
}

20.5.2.4. Implementing an Aggregation Multi-Function State

An aggregation multi-function state class must implement the AggregationMultiFunctionState interface and is responsible for the following functions:

Implement the applyEnter method that enters events, event properties or computed values.
Implement the applyLeave method that can remove events or computed values.
Implement the clear method to clear state.

In the Cycle-Detect example, the class CycleDetectorAggregationState is the state for all aggregation functions. Please review the example for more information.

20.5.2.5. Implementing an Aggregation Multi-Function Accessor Factory

An aggregation multi-function accessor factory class must implement the AggregationMultiFunctionAccessorFactory interface and is responsible for the following functions:

Implement the newAccessor method that returns a new accessor.

In the Cycle-Detect example, the class CycleDetectorAggregationAccessorDetectFactory returns the accessor like so:

public class CycleDetectorAggregationAccessorDetectFactory implements AggregationMultiFunctionAccessorFactory {
    public AggregationMultiFunctionAccessor newAccessor(AggregationMultiFunctionAccessorFactoryContext ctx) {
        return new CycleDetectorAggregationAccessorDetect();
    }
}

20.5.2.6. Implementing an Aggregation Multi-Function Accessor

An aggregation multi-function accessor class must implement the AggregationMultiFunctionAccessor interface and is responsible for the following functions:

Implement the Object getValue(AggregationMultiFunctionState state, ...) method that returns a result object for the aggregation state.
Implement the Collection<EventBean> getEnumerableEvents(AggregationMultiFunctionState state, ...) method that returns a collection of events for enumeration, if applicable (or null).
Implement the EventBean getEnumerableEvent(AggregationMultiFunctionState state, ...) method that returns an event, if applicable (or null).
Implement the Collection getEnumerableScalar(AggregationMultiFunctionState state, ...) method that returns an event, if applicable (or null).

In the Cycle-Detect example, the class CycleDetectorAggregationAccessorDetect returns state for the cycledetected aggregation function and the CycleDetectorAggregationAccessorOutput returns the state for the cycleoutput aggregation function.

20.5.2.7. Configuring the Aggregation Multi-Function Name

An aggregation multi-function configuration can receive one or multiple function names. You must also set a factory class name.

The sample XML snippet below configures an aggregation multi-function that is associated with the function names func1 and func2.

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-aggregation-multifunction 
        function-names="cycledetected,cycleoutput"
        forge-class="com.espertech.esper.example.cycledetect.CycleDetectorAggregationFactory"/>
  </compiler>
</esper-configuration>

The next example uses the configuration API to register the same:

String[] functionNames = new String[] {"cycledetected", "cycleoutput"};
ConfigurationPlugInAggregationMultiFunction config = new ConfigurationPlugInAggregationMultiFunction(functionNames, CycleDetectorAggregationFactory.class.getName());
Configuration configuration = new Configuration();
configuration.getCompiler().addPlugInAggregationMultiFunction(config);

20.5.2.8. Aggregation Multi-Function Thread Safety

The runtime shares an AggregationAccessor instance between threads. The accessor should be designed stateless and should not use any locking of any kind in the AggregationAccessor implementation unless your implementation uses other state. Since the runtime passes an aggregation state instance to the accessor it is thread-safe as long as it relies only on the aggregation state passed to it.

The runtime does not share an AggregationState instance between threads. There is no need to use locking of any kind in the AggregationState implementation unless your implementation uses other state.

20.5.2.9. Aggregation Multi-Function Use With Tables

Tables allow columns to hold aggregation state including the state for multi-function aggregations. This section provides API pointers.

When a statement accesses a table column that declares aggregation state of a multi-function aggregation, the AggregationMultiFunctionValidationContext contains an optionalTableColumnRead field that provides information about the table column.

To find out the statement type, such as to determine whether the current statement is a create-table statement, use context.getValidationContext().getExprEvaluatorContext().getStatementType().

To find out whether the statement aggregates into a table, use context.getValidationContext().getIntoTableName() that returns the table name or null if not aggregating into a table.

The compiler uses AggregationMultiFunctionStateKey to determine whether an aggregation function listed with into table is compatible with the aggregation type that a table column declares. The equals method of the object must return true for compatible and false for incompatible.

Your handler must provide a agent and table reader modes. Please follow the JavaDoc or inspect the regression test suite.

20.5.2.10. Aggregation Multi-Function Use Filter Expression

The filter expression is passed to you in PlugInAggregationMultiFunctionValidationContext as part of getNamedParameters under the name filter. When use with tables the filter expression is part of PlugInAggregationMultiFunctionAgentContext.

Your application must invoke the filter expression as the runtime does not evaluate the filter expression for you. For example:

ExprEvaluator filterEval = validationContext.getNamedParameters().get("filter").get(0).getExprEvaluator();

public void applyEnter(EventBean[] eventsPerStream, ExprEvaluatorContext exprEvaluatorContext) {
  Boolean pass = (Boolean) filterEval.evaluate(eventsPerStream, true, exprEvaluatorContext); // note: pass "false" for applyLeave
  if (pass != null && pass) {
    Object value = valueEval.evaluate(eventsPerStream, true, exprEvaluatorContext); // note: pass "false" for applyLeave
    // do something
  }
}

20.6. Pattern Guard

Pattern guards are pattern objects that control the lifecycle of the guarded sub-expression, and can filter the events fired by the subexpression.

The following steps are required to develop and use a custom guard object.

Implement a guard forge class, responsible for compile-time guard information.
Implement a guard factory class, responsible for creating guard object instances at runtime.
Implement a guard class (used at runtime).
Register the guard forge class with the compiler by supplying a namespace and name, via the compiler configuration.

The code for the example guard object as shown in this chapter can be found in the test source folder in the package com.espertech.esper.regressionlib.support.extend.pattern by the name MyCountToPatternGuardForge. The sample guard discussed here counts the number of events occurring up to a maximum number of events, and end the sub-expression when that maximum is reached.

Some of the APIs that you use to implement a pattern guard are internal APIs and are not stable and may change between releases. The client package contains all the stable interface classes.

20.6.1. Implementing a Guard Forge

A guard forge class is only used by the compiler and is responsible for the following functions:

Implement a setGuardParameters method that takes guard parameters, which are themselves expressions.
Implement a collectSchedule method that collects guard schedule objects if any.
Implement a makeCodegen method that provides the code to construct a guard factory at time of deployment.

Guard forge classes implement the GuardForge:

public class MyCountToPatternGuardForge implements GuardForge { ...

The compiler constructs one instance of the guard forge class for each time the guard is listed in a statement.

The guard forge class implements the setGuardParameters method that is passed the parameters to the guard as supplied by the statement. It verifies the guard parameters, similar to the code snippet shown next. Our example counter guard takes a single numeric parameter:

public void setGuardParameters(List<ExprNode> guardParameters, MatchedEventConvertorForge convertor, StatementCompileTimeServices services) throws GuardParameterException {
    String message = "Count-to guard takes a single integer-value expression as parameter";
    if (guardParameters.size() != 1) {
        throw new GuardParameterException(message);
    }

    Class paramType = guardParameters.get(0).getForge().getEvaluationType();
    if (paramType != Integer.class && paramType != int.class) {
        throw new GuardParameterException(message);
    }
        
    this.numCountToExpr = guardParameters.get(0);
    this.convertor = convertor;
}

The makeCodegen method is called by the compiler to receive the code that builds a guard factory. Use the SAIFFInitializeBuilder to build factory initialization code:

public CodegenExpression makeCodegen(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
    SAIFFInitializeBuilder builder = new SAIFFInitializeBuilder(MyCountToPatternGuardFactory.class, this.getClass(), "guardFactory", parent, symbols, classScope);
    return builder.exprnode("numCountToExpr", numCountToExpr)
                .expression("convertor", convertor.makeAnonymous(builder.getMethod(), classScope))
                .build();
}

20.6.2. Implementing a Guard Factory

A guard factory class is responsible for the following functions:

Implement a makeGuard method that constructs a new guard instance.

Guard factory classes implements the GuardFactory:

public class MyCountToPatternGuardFactory implements GuardFactory { ...

The runtime obtains an instance of the guard factory class at time of deployment.

The makeGuard method is called by the runtime to create a new guard instance. The example makeGuard method shown below passes the maximum count of events to the guard instance. It also passes a Quitable implementation to the guard instance. The guard uses Quitable to indicate that the sub-expression contained within must stop (quit) listening for events.

public Guard makeGuard(PatternAgentInstanceContext context, MatchedEventMap beginState, Quitable quitable, Object guardState) {
    EventBean[] events = convertor == null ? null : convertor.convert(beginState);
    Object parameter = PatternExpressionUtil.evaluateChecked("Count-to guard", numCountToExpr, events, context.getAgentInstanceContext());
    if (parameter == null) {
        throw new EPException("Count-to guard parameter evaluated to a null value");
    }

    Integer numCountTo = (Integer) parameter;
    return new MyCountToPatternGuard(numCountTo, quitable);
}

20.6.3. Implementing a Guard Class

A guard class has the following responsibilities:

Provides a startGuard method that initalizes the guard.
Provides a stopGuard method that stops the guard, called by the runtime when the whole pattern is stopped, or the sub-expression containing the guard is stopped.
Provides an inspect method that the pattern runtime invokes to determine if the guard lets matching events pass for further evaluation by the containing expression.

Guard classes implement the GuardSupport interface as shown here:

public class MyCountToPatternGuard implements Guard {

The compiler invokes the guard factory class to construct an instance of the guard class for each new sub-expression instance within a statement.

A guard class must provide an implementation of the startGuard method that the runtime invokes to start a guard instance. In our example, the method resets the guard's counter to zero:

public void startGuard() {
  counter = 0;
}

The runtime invokes the inspect method for each time the sub-expression indicates a new event result. Our example guard needs to count the number of events matched, and quit if the maximum number is reached:

public boolean inspect(MatchedEventMap matchEvent) {
  counter++;
  if (counter > numCountTo) {
    quitable.guardQuit();
    return false;
  }
  return true;
}

The inspect method returns true for events that pass the guard, and false for events that should not pass the guard.

20.6.4. Configuring Guard Namespace and Name

The guard factory class name as well as the namespace and name for the new guard must be added to the compiler configuration. The configuration shown below is XML however the same options are available through the configuration API:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-pattern-guard namespace="myplugin" name="count_to" 
        forge-class="com.espertech.esper.regressionlib.support.extend.pattern.MyCountToPatternGuardForge"/>
  </compiler>
</esper-configuration>

The new guard is now ready to use in a statement. The next pattern statement detects the first 10 MyEvent events:

select * from pattern [(every MyEvent) where myplugin:count_to(10)]

Note that the every keyword was placed within parentheses to ensure the guard controls the repeated matching of events.

20.7. Pattern Observer

Pattern observers are pattern objects that are executed as part of a pattern expression and can observe events or test conditions. Examples for built-in observers are timer:at and timer:interval. Some suggested uses of observer objects are:

Implement custom scheduling logic using the runtime's own scheduling and timer services
Test conditions related to prior events matching an expression

The following steps are required to develop and use a custom observer object within pattern statements:

Implement an observer forge class, which is used by the compiler only and is responsible for validating parameters and for initializing an observer factory.
Implement an observer factory class, responsible for creating observer object instances.
Implement an observer class.
Register an observer factory class with the compiler by supplying a namespace and name, via the compiler configuration file or the configuration API.

The code for the example observer object as shown in this chapter can be found in the test source folder in package com.espertech.esper.regression.client by the name MyFileExistsObserver. The sample observer discussed here very simply checks if a file exists, using the filename supplied by the pattern statement, and via the java.io.File class.

Some of the APIs that you use to implement a pattern observer are internal APIs and are not stable and may change between releases. The client package contains all the stable interface classes.

20.7.1. Implementing an Observer Forge

An observer forge class is responsible for the following functions:

Implement a setObserverParameters method that takes observer parameters, which are themselves expressions.
Implement a collectSchedule method that collects observer schedule objects if any.
Implement a makeCodegen method that provides the code to construct an observer factory at time of deployment.

Observer forge classes implement the ObserverForge interface:

public class MyFileExistsObserverForge implements ObserverForge { ...

The compiler constructs one instance of the observer forge class for each time the observer is listed in a statement.

The observer forge class implements the setObserverParameters method that is passed the parameters to the observer as supplied by the statement. It verifies the observer parameters, similar to the code snippet shown next. Our example file-exists observer takes a single string parameter:

public void setObserverParameters(List<ExprNode> observerParameters, MatchedEventConvertorForge convertor, ExprValidationContext validationContext) throws ObserverParameterException {
    String message = "File exists observer takes a single string filename parameter";
    if (observerParameters.size() != 1) {
        throw new ObserverParameterException(message);
    }
    if (!(observerParameters.get(0).getForge().getEvaluationType() == String.class)) {
        throw new ObserverParameterException(message);
    }

    this.filenameExpression = observerParameters.get(0);
    this.convertor = convertor;
}

The compiler calls the makeCodegen method to provide code that initializes the observer factory at time of deployment. It uses the SAIFFInitializeBuilder to build the code.

public CodegenExpression makeCodegen(CodegenMethodScope parent, SAIFFInitializeSymbol symbols, CodegenClassScope classScope) {
    SAIFFInitializeBuilder builder = new SAIFFInitializeBuilder(MyFileExistsObserverFactory.class, this.getClass(), "observerFactory", parent, symbols, classScope);
    return builder.exprnode("filenameExpression", filenameExpression)
            .expression("convertor", convertor.makeAnonymous(builder.getMethod(), classScope))
            .build();
}

20.7.2. Implementing an Observer Factory

An observer factory class is responsible for the following functions:

Implement a makeObserver method that returns a new observer instance.

Observer factory classes implement the ObserverFactory:

public class MyFileExistsObserverFactory implements ObserverFactory { ...

The runtime obtains an instance of the observer factory class at time of deployment.

The runtime calls the makeObserver method to create a new observer instance. The example makeObserver method shown below passes parameters to the observer instance:

public EventObserver makeObserver(PatternAgentInstanceContext context, MatchedEventMap beginState, ObserverEventEvaluator observerEventEvaluator, Object observerState, boolean isFilterChildNonQuitting) {
    EventBean[] events = convertor == null ? null : convertor.convert(beginState);
    Object filename = PatternExpressionUtil.evaluateChecked("File-exists observer ", filenameExpression, events, context.getAgentInstanceContext());
    if (filename == null) {
        throw new EPException("Filename evaluated to null");
    }
    return new MyFileExistsObserver(beginState, observerEventEvaluator, filename.toString());
}

The ObserverEventEvaluator parameter allows an observer to indicate events, and to indicate change of truth value to permanently false. Use this interface to indicate when your observer has received or witnessed an event, or changed it's truth value to true or permanently false.

The MatchedEventMap parameter provides a Map of all matching events for the expression prior to the observer's start. For example, consider a pattern as below:

a=MyEvent -> myplugin:my_observer(...)

The above pattern tagged the MyEvent instance with the tag "a". The runtime starts an instance of my_observer when it receives the first MyEvent. The observer can query the MatchedEventMap using "a" as a key and obtain the tagged event.

20.7.3. Implementing an Observer Class

An observer class has the following responsibilities:

Provides a startObserve method that starts the observer.
Provides a stopObserve method that stops the observer, called by the runtime when the whole pattern is stopped, or the sub-expression containing the observer is stopped.

Observer classes subclass com.espertech.esper.pattern.observer.ObserverSupport as shown here:

public class MyFileExistsObserver implements EventObserver { ...

The runtime invokes the observer factory class to construct an instance of the observer class for each new sub-expression instance within a statement.

An observer class must provide an implementation of the startObserve method that the runtime invokes to start an observer instance. In our example, the observer checks for the presence of a file and indicates the truth value to the remainder of the expression:

public void startObserve() {
  File file = new File(filename);
  if (file.exists()) {
    observerEventEvaluator.observerEvaluateTrue(beginState);
  } 
  else {
    observerEventEvaluator.observerEvaluateFalse(); 
  }
}

Note the observer passes the ObserverEventEvaluator an instance of MatchedEventMap. The observer can also create one or more new events and pass these events through the Map to the remaining expressions in the pattern.

20.7.4. Configuring Observer Namespace and Name

The observer factory class name as well as the namespace and name for the new observer must be added to the compiler configuration via the configuration API or using the XML configuration file. The configuration shown below is XML however the same options are available through the configuration API:

<esper-configuration xmlns="http://www.espertech.com/schema/esper">
  <compiler>
    <plugin-pattern-observer namespace="myplugin" name="file_exists" 
      forge-class="com.espertech.esper.regressionlib.support.extend.pattern.MyFileExistsObserverForge" />
  </compiler>
</esper-configuration>

The new observer is now ready to use in a statement. The next pattern statement checks every 10 seconds if the given file exists, and indicates to the listener when the file is found.

select * from pattern [every timer:interval(10 sec) -> myplugin:file_exists("myfile.txt")]

Chapter 21. Examples, Tutorials, Case Studies

21.1. Examples Overview

21.2. Running the Examples

21.3. AutoID RFID Reader

21.4. Runtime Configuration

21.5. JMS Server Shell and Client

21.5.1. Overview
21.5.2. JMS Messages as Events
21.5.3. JMX for Remote Dynamic Statement Management

21.6. Market Data Feed Monitor

21.6.1. Input Events
21.6.2. Computing Rates per Feed
21.6.3. Detecting a Fall-Off
21.6.4. Event generator

21.7. OHLC Plug-In Data Window

21.8. Transaction 3-Event Challenge

21.8.1. The Events
21.8.2. Combined Event
21.8.3. Real-Time Summary Data
21.8.4. Find Problems
21.8.5. Event Generator

21.9. Self-Service Terminal

21.9.1. Events
21.9.2. Detecting Customer Check-In Issues
21.9.3. Absence of Status Events
21.9.4. Activity Summary Data
21.9.5. Sample Application for J2EE Application Server

21.10. Assets Moving Across Zones - An RFID Example

21.11. StockTicker

21.12. MatchMaker

21.13. Named Window Query

21.14. Sample Virtual Data Window

21.15. Sample Cycle Detection

21.16. Quality of Service

21.17. Trivia Geeks Club

21.1. Examples Overview

This chapter outlines the examples that come with the distribution in the examples folder of the distribution. Each sample is in a separate folder that contains all files needed by the example, excluding jar files.

Here is an overview over the examples in alphabetical order:

Table 21.1. Examples

Name	Description
Section 21.3, “AutoID RFID Reader”	An array of RFID readers sense RFID tags as pallets are coming within the range of one of the readers. Shows the use of an XSD schema and XML event representation. A single statement shows a rolling time window, a where-clause filter on a nested property and a group-by.
Section 21.6, “Market Data Feed Monitor”	Processes a raw market data feed and reports throughput statistics and detects when the data rate of a feed falls off unexpectedly. Demonstrates a batch time window and a rolling time window with a having-clause. Multi-threaded example with a configurable number of threads and a simulator for generating feed data.
Section 21.12, “MatchMaker”	In the MatchMaker example every mobile user has an X and Y location and the task of the event patterns created by this example is to detect mobile users that are within proximity given a certain range, and for which certain properties match preferences. Uses an overlapping context to find matching mobile users based on mobile user events.
Section 21.13, “Named Window Query”	A mini-benchmark that handles temperature sensor events. The sample creates a named window and fills it with a large number of events. It then executes a large number of pre-compiled statements as well as fire-and-forget queries and reports times. Study this example if you are interested in named windows, Map event type representation, fire-and-forget queries as well as pre-defined statements via on-select, and the performance aspects.
Section 21.14, “Sample Virtual Data Window”	This example demonstrates the use of virtual data window to expose a (large) external data store, without any need to keep events in memory, and without sacrificing statement performance.
Section 21.15, “Sample Cycle Detection”	This example showcases the aggregation multi-function extension API for use with a cycle-detection problem detecting cycles in transactions between accounts.
Section 21.7, “OHLC Plug-In Data Window”	A plug-in custom data window addressing a problem in the financial space: Computes open-high-low-close bars for minute-intervals of events that may arrive late, based on each event's timestamp. A custom plug-in data window based on the extension API can be a convenient and reusable way to express a domain-specific analysis problem as a unit, and this example includes the code for the OHLC data window factory and window as well as simulator to test the window.
Section 22.3, “Using the Performance Kit”	A benchmark that is further described in the performance section of this document under performance kit.
Section 21.16, “Quality of Service”	This example develops some code for measuring quality-of-service levels such as for a service-level agreement (SLA). This example combines patterns with select-statements, shows the use of the timer `'at'` operator and followed-by operator `->`, and uses the iterator API to poll for current results.
Section 21.10, “Assets Moving Across Zones - An RFID Example”	An example out of the RFID domain processes location report events. The example includes a simple Swing-based GUI for visualization allows moving tags from zone to zone visually. It also a contains comprehensive simulator to generate data for a large number of asset groups and their tracking. The example uses non-overlapping context to detect patterns in the aggregated data to determine when an asset group constraint is violated.
Section 21.4, “Runtime Configuration”	Example code to demonstrate various key compile-time and runtime actions such as adding event types on-the-fly, adding new variables, adding plug-in single-row and aggregation functions and adding variant streams.
Section 21.5, “JMS Server Shell and Client”	The server shell is a Java Messaging Service (JMS) -based server and client that send and listens to messages on a JMS destination. It also demonstrates a simple Java Management Extension (JMX) MBean for remote statement management. A single statement computes an average duration for each IP address on a rolling time window and outputs a snapshot every 2 seconds.
Section 21.11, “StockTicker”	An example from the financial domain that features event patterns to filter stock tick events based on price and symbol. The example is designed to provide a high volume of events and includes multithreaded unit test code as well as a simulting data generator. The example uses overlapping context to find when price spikes happen based on price limit events received.
Section 21.9, “Self-Service Terminal”	A J2EE-based self-service terminal managing system in an airport that gets a lot of events from connected terminals. Contains a message-driven bean (EJB-MDB) for use in a J2EE container, a client and a simulator, as well as statements for detecting various conditions. A version that runs outside of a J2EE container is also available.
Section 21.17, “Trivia Geeks Club”	Trivia Geeks Club demonstrates EPL for a scoring system computing scores in a trivia game.

21.2. Running the Examples

In order to compile and run the samples please follow the below instructions:

Make sure Java 1.6 or greater is installed and the JAVA_HOME environment variable is set.
Open a console window and change directory to examples/example_name/etc.
Run "setenv.bat" (Windows) or "setenv.sh" (Unix) to verify your environment settings.
Run "compile.bat" (Windows) or "compile.sh" (Unix) to compile an example.
Now you are ready to run an example. Some examples require mandatory parameters that are also described in the file "readme.txt" in the "etc" folder.
Modify the logger logging level in the "log4j.xml" configuration file changing DEBUG to INFO on a class or package level to control the volume of text output.

Each example also provides Eclipse project .classpath and .project files. The Eclipse projects expect an esper_runtime user library that includes the runtime dependencies.

JUnit tests exist for the example code. The JUnit test source code for the examples can be found in each example's src/test folder. To build and run the example JUnit tests, use the Maven 2 goal test.

21.3. AutoID RFID Reader

In this example an array of RFID readers sense RFID tags as pallets are coming within the range of one of the readers. A reader generates XML documents with observation information such as reader sensor ID, observation time and tags observed. A statement computes the total number of tags per reader sensor ID within the last 60 seconds.

This example demonstrates how XML documents unmarshalled to org.w3c.dom.Node DOM document nodes can natively be processed by the runtime without requiring Java object event representations. The example uses an XPath expression for an event property counting the number of tags observed by a sensor. The XML documents follow the AutoID (http://www.autoid.org/) organization standard.

The classes for this example can be found in package com.espertech.esper.example.autoid. As events are XML documents with no Java object representation, the example does not have event classes.

A simulator that can be run from the command line is also available for this example. The simulator generates a number of XML documents as specified by a command line argument and prints out the totals per sensor. Run "run_autoid.bat" (Windows) or "run_autoid.sh" (Unix) to start the AutoID simulator. Please see the readme file in the same folder for build instructions and command line parameters.

The code snippet below shows the simple statement to compute the total number of tags per sensor. The statement is created by class com.espertech.esper.example.autoid.RFIDTagsPerSensorStmt.

select ID as sensorId, sum(countTags) as numTagsPerSensor
from AutoIdRFIDExample#time(60 seconds)
where Observation[0].Command = 'READ_PALLET_TAGS_ONLY'
group by ID

21.4. Runtime Configuration

This example demonstrates various key runtime configuration options such as adding event types on-the-fly, adding new variables, adding plug-in single-row and aggregation functions and adding variant streams.

The classes for this example live in package com.espertech.esper.example.runtimeconfig.

21.5. JMS Server Shell and Client

21.5.1. Overview

The server shell is a Java Messaging Service (JMS) -based server that listens to messages on a JMS destination, and sends the received events into the runtime. The example also demonstrates a Java Management Extension (JMX) MBean that allows remote dynamic statement management. This server has been designed to run with either Tibco (TM) Enterprise Messaging System (Tibco EMS), or with Apache ActiveMQ, controlled by a properties file.

The server shell has been created as an alternative to the EsperIO Spring JMSTemplate adapter. The server shell is a low-latency processor for byte messages. It employs JMS listeners to process message in multiple threads, this model reduces thread context switching for many JMS providers. The server is configurable and has been tested with two JMS providers. It consists of only 10 classes and is thus easy to understand.

The server shell sample comes with a client (server shell client) that sends events into the JMS-based server, and that also creates a statement on the server remotely through a JMX MBean proxy class.

The server shell classes for this example live in package com.espertech.esper.example.servershell. Configure the server to point to your JMS provider by changing the properties in the file servershell_config.properties in the etc folder. Make sure your JMS provider (ActiveMQ or Tibco EMS) is running, then run "run_servershell.bat" (Windows) or "run_servershell.sh" (Unix) to start the JMS server.

Start the server shell process first before starting the client, since the client also demonstrates remote statement management through JMX by attaching to the server process.

The client classes to the server shell can be found in package com.espertech.esper.example.servershellclient. The client shares the same configuration file as the server shell. Run "run_servershellclient.bat" (Windows) or "run_servershellclient.sh" (Unix) to start the JMS producer client that includes a JMX client as well.

21.5.2. JMS Messages as Events

The server shell starts a configurable number of JMS MessageListener instances that listen to a given JMS destination. The listeners expect a BytesMessage that contain a String payload. The payload consists of an IP address and a double-typed duration value separated by a comma.

Each listener extracts the payload of a message, constructs an event object and sends the event into the shared runtime instance.

At startup time, the server creates a single statement with the runtime that prints out the average duration per IP address for the last 10 seconds of events, and that specifies an output rate of 2 seconds. By running the server and then the client, you can see the output of the averages every 2 seconds.

The server shell client acts as a JMS producer that sends 1000 events with random IP addresses and durations.

21.5.3. JMX for Remote Dynamic Statement Management

The server shell is also a JMX server providing an RMI-based connector. The server shell exposes a JMX MBean that allows remote statement management. The JMX MBean allows to create a statement remotely, attach a listener to the statement and undeploy a statement remotely.

The server shell client, upon startup, obtains a remote instance of the management MBean exposed by the server shell. It creates a statement through the MBean that filters out all durations greater then the value 9.9. After sending 1000 events, the client then undeploys the statement remotely on the server.

21.6. Market Data Feed Monitor

This example processes a raw market data feed. It reports throughput statistics and detects when the data rate of a feed falls off unexpectedly. A rate fall-off may mean that the data is stale and you want to alert when there is a possible problem with the feed.

The classes for this example live in package com.espertech.esper.example.marketdatafeed. Run "run_mktdatafeed.bat" (Windows) or "run_mktdatafeed.sh" (Unix) in the examples/etc folder to start the market data feed simulator.

21.6.1. Input Events

The input stream consists of 1 event stream that contains 2 simulated market data feeds. Each individual event in the stream indicates the feed that supplies the market data, the security symbol and some pricing information:

String symbol;
FeedEnum feed;
double bidPrice;
double askPrice;

21.6.2. Computing Rates per Feed

For throughput statistics and to detect rapid fall-off, the example calculates a ticks per second rate for each market data feed.

You can use a statement that specifies a data window onto the market data event stream that batches together 1 second of events. You specify the feed and a count of events per feed as output values. To make this data available for further processing, you insert output events into the TicksPerSecond event stream:

insert into TicksPerSecond
select feed, count(*) as cnt 
  from MarketDataEvent#time_batch(1 second) 
 group by feed

21.6.3. Detecting a Fall-Off

We define a rapid fall-off by alerting when the number of ticks per second for any second falls below 75% of the average number of ticks per second over the last 10 seconds.

We can compute the average number of ticks per second over the last 10 seconds simply by using the TicksPerSecond events computed by the prior statement and averaging the last 10 seconds. Next, the example compares the current rate with the moving average and filter out any rates that fall below 75% of the average:

select feed, avg(cnt) as avgCnt, cnt as feedCnt 
  from TicksPerSecond#time(10 seconds)
 group by feed 
having cnt < avg(cnt) * 0.75

21.6.4. Event generator

The simulator generates market data events for 2 feeds, feed A and feed B. The first parameter to the simulator is a number of threads. Each thread sends events for each feed in an endless loop. Note that as the Java VM garbage collection kicks in, the example generates rate drop-offs during such pauses.

The second parameter is a rate drop probability parameter specifies the probability in percent that the simulator drops the rate for a randomly chosen feed to 60% of the target rate for that second. Thus rate fall-off alerts can be generated.

The third parameter defines the number of seconds to run the example.

21.7. OHLC Plug-In Data Window

This example contains a fully-functional custom data window based on the extension API that computes OHLC open-high-low-close bars for events that provide a long-typed timestamp and a double-typed value.

OHLC bar is a problem out of the financial domain. The "Open" refers to the first datapoint and the "Close" to the last datapoint in an interval. The "High" refers to the maximum and the "Low" to the minimum value during each interval. The term "bar" is used to describe each interval results of these 4 values.

The example provides an OHLC data window that is hardcoded to 1-minute bars. It considers the timestamp value carried by each event, and not the system time. The cutoff time after which an event is no longer considered for a bar is hardcoded to 5 seconds.

The window assumes that events arrive in timestamp order: Each event's timestamp value is equal to or higher then the timestamp value provided by the prior event.

The window may also be used together with #groupwin to group per criteria, such as symbol. In this case the assumption of timestamp order applies per symbol.

The window gracefully handles no-event and late-event scenarios. Interval boundaries are defined by system time, thus event timestamp and system time must roughly be in-sync, unless using external timer events.

21.8. Transaction 3-Event Challenge

The classes for this example live in package com.espertech.esper.example.transaction. Run "run_txnsim.bat" (Windows) or "run_txnsim.sh" (Unix) to start the transaction simulator. Please see the readme file in the same folder for build instructions and command line parameters.

21.8.1. The Events

The use case involves tracking three components of a transaction. It's important that the example uses at least three components, since some runtimes have different performance or coding for only two events per transaction. Each component comes to the runtime as an event with the following fields:

Transaction ID
Time stamp

In addition, the example has the following extra fields:

In event A:

Customer ID

In event C:

Supplier ID (the ID of the supplier that the order was filled through)

21.8.2. Combined Event

We need to take in events A, B and C and produce a single, combined event with the following fields:

Transaction ID
Customer ID
Time stamp from event A
Time stamp from event B
Time stamp from event C

What we‘re doing here is matching the transaction IDs on each event, to form an aggregate event. If all these events were in a relational database, this could be done as a simple SQL join… except that with 10,000 events per second, you will need some serious database hardware to do it.

21.8.3. Real-Time Summary Data

Further, the example produces the following:

Min,Max,Average total latency from the events (difference in time between A and C) over the past 30 minutes.
Min,Max,Average latency grouped by (a) customer ID and (b) supplier ID. In other words, metrics on the the latency of the orders coming from each customer and going to each supplier.
Min,Max,Average latency between events A/B (time stamp of B minus A) and B/C (time stamp of C minus B).

21.8.4. Find Problems

The example detects a transaction that did not make it through all three events. In other words, a transaction with events A or B, but not C. Note that, in this case, what you care about is event C. The lack of events A or B could indicate a failure in the event transport and should be ignored. Although the lack of an event C could also be a transport failure, it merits looking into.

21.8.5. Event Generator

To make testing easier, standard and to demonstrate how the example works, the example is including an event generator. The generator generates events for a given number of transactions, using the following rules:

One in 5,000 transactions will skip event A
One in 1,000 transactions will skip event B
One in 10,000 transactions will skip event C.
Transaction identifiers are randomly generated
Customer and supplier identifiers are randomly chosen from two lists
The time stamp on each event is based on the system time. Between events A and B as well as B and C, between 0 and 999 is added to the time. So, you have an expected time difference of around 500 milliseconds between each event
Events are randomly shuffled as described below

To make things harder, the example doesn't have transaction events coming in order. This code ensures that they come completely out of order. To do this, the example fills in a bucket with events and, when the bucket is full, it shuffles it. The buckets are sized so that some transactions‘ events will be split between buckets. So, you have a fairly randomized flow of events, representing the worst case from a big, distributed infrastructure.

The generator lets you change the size of the bucket (small, medium, large, larger, largerer). The larger the bucket size, the more events potentially come in between two events in a given transaction and so, the more the performance characteristics like buffers, hashes/indexes and other structures are put to the test as the bucket size increases.

21.9. Self-Service Terminal

The example is about a J2EE-based self-service terminal managing system in an airport that gets a lot of events from connected terminals. The event rate is around 500 events per second. Some events indicate abnormal situations such as 'paper low' or 'terminal out of order'. Other events observe activity as customers use a terminal to check in and print boarding tickets.

21.9.1. Events

Each self-service terminal can publish any of the 6 events below.

Checkin - Indicates a customer started a check-in dialog
Cancelled - Indicates a customer cancelled a check-in dialog
Completed - Indicates a customer completed a check-in dialog
OutOfOrder - Indicates the terminal detected a hardware problem
LowPaper - Indicates the terminal is low on paper
Status - Indicates terminal status, published every 1 minute regardless of activity as a terminal heartbeat

All events provide information about the terminal that published the event, and a timestamp. The terminal information is held in a property named "term" and provides a terminal id. Since all events carry similar information, it models each event as a subtype to a base class BaseTerminalEvent, which will provide the terminal information that all events share. This enables us to treat all terminal events polymorphically, that is you can treat derived event types just like their parent event types. This helps simplify our statements.

All terminals publish Status events every 1 minute. In normal cases, the Status events indicate that a terminal is alive and online. The absence of status events may indicate that a terminal went offline for some reason and that may need to be investigated.

21.9.2. Detecting Customer Check-In Issues

A customer may be in the middle of a check-in when the terminal detects a hardware problem or when the network goes down. In that situation the example alerts a team member to help the customer. When the terminal detects a problem, it issues an OutOfOrder event. A pattern can find situations where the terminal indicates out-of-order and the customer is in the middle of the check-in process:

select * from pattern [ every a=Checkin -> 
      ( OutOfOrder(term.id=a.term.id) and not 
          (Cancelled(term.id=a.term.id) or Completed(term.id=a.term.id)) )]

21.9.3. Absence of Status Events

Since Status events arrive in regular intervals of 60 seconds, you can make use of temporal pattern matching using timer to find events that didn't arrive. You can use the every operator and timer:interval() to repeat an action every 60 seconds. Then you combine this with a not operator to check for absence of Status events. A 65 second interval during which you look for Status events allows 5 seconds to account for a possible delay in transmission or processing:

select 'terminal 1 is offline' from pattern 
  [every timer:interval(60 sec) -> (timer:interval(65 sec) and not Status(term.id = 'T1'))]
output first every 5 minutes

21.9.4. Activity Summary Data

By presenting statistical information about terminal activity to our staff in real-time you enable them to monitor the system and spot problems. The next example statement simply gives us a count per event type every 1 minute. You could further use this data, available through the CountPerType event stream, to join and compare against a recorded usage pattern, or to just summarize activity in real-time.

insert into CountPerType
select type, count(*) as countPerType 
from BaseTerminalEvent#time(10 minutes) 
group by type
output all every 1 minutes

21.9.5. Sample Application for J2EE Application Server

The example code in the distribution package implements a message-driven enterprise java bean (MDB EJB). The example uses an MDB as a convenient place for processing incoming events via a JMS message queue or topic. The example uses 2 JMS queues: One queue to receive events published by terminals, and a second queue to indicate situations detected via statement and listener back to a receiving process.

This example has been packaged for deployment into a JBoss Java application server (see http://www.jboss.org) with default deployment configuration. JBoss is an open-source application server available under LGPL license. Of course the choice of application server does not indicate a requirement or preference for the use of the compiler and/or runtime in a J2EE container. Other quality J2EE application servers are available and perhaps more suitable to run this example or a similar application.

The complete example code can be found in the "examples/terminalsvc" folder of the distribution. The standalone version that does not require a J2EE container is in "examples/terminalsvc-jse".

21.9.5.1. Running the Example

The pre-build EAR file contains the MDB for deployment to a JBoss application server with default deployment options. The JBoss default configuration provides 2 queues that this example utilizes: queue/A and queue/B. The queue/B is used to send events into the MDB, while queue/A is used to indicate back the any data received by listeners to statements.

The application can be deployed by copying the ear file in the "examples/terminalsvc/terminalsvc-ear" folder to your JBoss deployment directory located under the JBoss home directory under "standalone/deployments".

The example contains an event simulator and an event receiver that can be invoked from the command line. See the folder "examples/terminalsvc/etc" folder readme file and start scripts for Windows and Unix, and the documentation set for further information on the simulator.

21.9.5.2. Building the Example

This example requires Maven 2 to build. To build the example, change directory to the folder "examples/terminalsvc" and type "mvn package". The instructions have been tested with JBoss AS 7.1.1 and Maven 3.0.4.

The Maven build packages the EAR file for deployment to a JBoss application server with default deployment options.

21.9.5.3. Running the Event Simulator and Receiver

The example also contains an event simulator that generates meaningful events. The simulator can be run from the directory "examples/terminalsvc/etc" via the command "run_terminalsvc_sender.bat" (Windows) and "run_terminalsvc_sender.sh" (Linux). The event simulator generates a batch of at least 200 events every 1 second. Randomly, with a chance of 1 in 10 for each batch of events, the simulator generates either an OutOfOrder or a LowPaper event for a random terminal. Each batch the simulator generates 100 random terminal ids and generates a Checkin event for each. It then generates either a Cancelled or a Completed event for each. With a chance of 1 in 1000, it generates an OutOfOrder event instead of the Cancelled or Completed event for a terminal.

The event receiver listens to the MDB-outcoming queue for alerts and prints these out to console. The receiver can be run from the directory "examples/terminalsvc/etc" via the command "run_terminalsvc_receiver.bat" (Windows) and "run_terminalsvc_receiver.sh" (Linux). Before running please copy the jboss-client.jar file from your JBoss AS installation bin directory to the "terminalsvc/lib" folder.

The receiver and sender code use "guest" as user and "pass" as password. Add the "guest" user using the Jboss "add-user" script and assign the role "guest". Your JBoss server may need to start with "-c standalone-full.xml" to have the messaging subsystem available.

Add queue configurations to the messaging subsystem configuration as follows:

<jms-queue name="queue_a">
  <entry name="queue_a"/>
  <entry name="java:jboss/exported/jms/queue/queue_a"/>
</jms-queue>
<jms-queue name="queue_b">
  <entry name="queue_b"/>
  <entry name="java:jboss/exported/jms/queue/queue_b"/>
</jms-queue>

Disable persistence in the messaging subsystem for this example so it is not running out of disk space:

<persistence-enabled>false</persistence-enabled>

21.10. Assets Moving Across Zones - An RFID Example

This example out of the RFID domain processes location report events. Each location report event indicates an asset id and the current zone of the asset. The example solves the problem that when a given set of assets is not moving together from zone to zone, then an alert must be fired.

Each asset group is tracked by 2 statements. The two statements to track a single asset group consisting of assets identified by asset ids {1, 2, 3} are as follows:

insert into CountZone_G1
select 1 as groupId, zone, count(*) as cnt
from LocationReport(assetId in 1, 2, 3)#unique(assetId)
group by zone

select Part.zone from pattern [
  every Part=CountZone_G1(cnt in (1,2)) ->
    (timer:interval(10 sec)  and not CountZone_G1(zone=Part.zone, cnt in (0,3)))]

The classes for this example can be found in package com.espertech.esper.example.rfid.

This example provides a Swing-based GUI that can be run from the command line. The GUI allows drag-and-drop of three RFID tags that form one asset group from zone to zone. Each time you move an asset across the screen the example sends an event into the runtime indicating the asset id and current zone. The example detects if within 10 seconds the three assets do not join each other within the same zone, but stay split across zones. Run "run_rfid_swing.bat" (Windows) or "run_rfid_swing.sh" (Unix) to start the example's Swing GUI.

The example also provides a simulator that can be run from the command line. The simulator generates a number of asset groups as specified by a command line argument and starts a number of threads as specified by a command line argument to send location report events into the runtime. Run "run_rfid_sim.bat" (Windows) or "run_rfid_sim.sh" (Unix) to start the RFID location report event simulator. Please see the readme file in the same folder for build instructions and command line parameters.

21.11. StockTicker

The StockTicker example comes from the stock trading domain. The example creates event patterns to filter stock tick events based on price and symbol. When a stock tick event is encountered that falls outside the lower or upper price limit, the example simply displays that stock tick event. The price range itself is dynamically created and changed. This is accomplished by an overlapping context that uses price limit event to determine how to look for price spikes.

The classes StockTick and PriceLimit represent our events. The event patterns are created by the class StockTickerEPLUtil.

Summary:

Good example to learn the API and get started with contexts and patterns.
When price limit events arrive allocates patterns that find the price spike.
Simple, highly-performant filter expressions for event properties in the stock tick event such as symbol and price.

21.12. MatchMaker

In the MatchMaker example every mobile user has an X and Y location, a set of properties (gender, hair color, age range) and a set of preferences (one for each property) to match. The task of the event patterns created by this example is to detect mobile users that are within proximity given a certain range, and for which the properties match preferences.

The event class representing mobile users is MobileUserBean. The MatchMakerEPL class contains the patterns for detecing matches.

Summary:

Uses overlapping context to find matching mobile user events
Uses range matching for X and Y properties of mobile user events

21.13. Named Window Query

This example handles very minimal temperature sensor events which are represented by java.util.Map. It creates a named window and fills it with a large number of events. It then executes a large number of pre-defined statements via on-select as well as performs a large number of fire-and-forget queries against the named window, and reports execution times.

21.14. Sample Virtual Data Window

Virtual data windows are an extension API used to integrate external stores and expose the data therein as a named window.

See the virtualdw folder for example code, compile and run scripts.

21.15. Sample Cycle Detection

The example is also discussed in the section on extension APIs specifically the aggregation multi-function development. The example uses the jgrapht library for a cycle-detection problem detecting cycles in transactions between accounts.

See the examples/cycledetect folder for example code, compile and run scripts.

21.16. Quality of Service

This example develops some code for measuring quality-of-service levels such as for a service-level agreement (SLA). A SLA is a contract between 2 parties that defines service constraints such as maximum latency for service operations or error rates.

The example measures and monitors operation latency and error counts per customer and operation. When one of our operations oversteps these constraints, you want to be alerted right away. Additionally, you would like to have some monitoring in place that checks the health of our service and provides some information on how the operations are used.

Some of the constraints you need to check are:

That the latency (time to finish) of some of the operations is always less then X seconds.
That the latency average is always less then Y seconds over Z operation invocations.

The com.espertech.esper.example.qos_sla.events.OperationMeasurement event class with its latency and status properties is the main event used for the SLA analysis. The other event LatencyLimit serves to set latency limits on the fly.

The com.espertech.esper.example.qos_sla.monitor.AverageLatencyMonitor creates a statement that computes latency statistics per customer and operation for the last 100 events. The DynaLatencySpikeMonitor uses an event pattern to listen to spikes in latency with dynamically set limits. The ErrorRateMonitor uses the timer 'at' operator in an event pattern that wakes up periodically and polls the error rate within the last 10 minutes. The ServiceHealthMonitor simply alerts when 3 errors occur, and the SpikeAndErrorMonitor alerts when a fixed latency is overstepped or an error status is reported.

Summary:

This example combines event patterns with statements for event stream analysis.
Shows the use of the timer 'at' operator and followed-by operator -> in event patterns.
Outlines basic statements.
Shows how to pull data out of statements rather then subscribing to events a statement publishes.

21.17. Trivia Geeks Club

This example was developed for the DEBS 2011 conference and demonstrates how scoring rules for a trivia game can be implemented in EPL.

The module that implements all scoring rules is located in the etc folder in file trivia.epl. The EPL is all required to run the solution without any custom functions required.

The trivia geeks club rules (the requirements) are provided in the etc folder in file trivia_scoring_requirements.htm.

The implementation provided tests the questions, answers and scoring according to the data provided in trivia_test_questions_small.htm and trivia_test_questions_large.htm.

Chapter 22. Performance

22.1. Performance Results

22.2. Performance Tips

22.2.1. Understand How to Tune Your Java Virtual Machine
22.2.2. Input and Output Bottlenecks
22.2.3. Threading
22.2.4. Select the Underlying Event Rather Than Individual Fields
22.2.5. Prefer Stream-Level Filtering Over Where-Clause Filtering
22.2.6. Reduce the Use of Arithmetic in Expressions
22.2.7. Remove Unneccessary Constructs
22.2.8. End Pattern Sub-Expressions
22.2.9. Consider Using EventPropertyGetter for Fast Access to Event Properties
22.2.10. Consider Casting the Underlying Event
22.2.11. Turn Off Logging and Audit
22.2.12. Tune or Disable Delivery Order Guarantees
22.2.13. Use a Subscriber Object to Receive Events
22.2.14. Consider Data Flows
22.2.15. High-Arrival-Rate Streams and Single Statements
22.2.16. Subqueries Versus Joins and Where-Clause and Data Windows
22.2.17. Patterns and Pattern Sub-Expression Instances
22.2.18. Pattern Sub-Expression Instance Versus Data Window Use
22.2.19. The Keep-All Data Window
22.2.20. Statement Design for Reduced Memory Consumption - Diagnosing OutOfMemoryError
22.2.21. Performance, JVM, OS and Hardware
22.2.22. Consider Using Hints
22.2.23. Optimizing Stream Filter Expressions
22.2.24. Statement and Runtime Metric Reporting
22.2.25. Expression Evaluation Order and Early Exit
22.2.26. Large Number of Threads
22.2.27. Filter Evaluation Tuning
22.2.28. Context Partition Related Information
22.2.29. Prefer Constant Variables Over Non-Constant Variables
22.2.30. Prefer Object-Array Events
22.2.31. Composite or Compound Keys
22.2.32. Notes on Query Planning
22.2.33. Query Planning Expression Analysis Hints
22.2.34. Query Planning Index Hints
22.2.35. Measuring Throughput
22.2.36. Do Not Create the Same or Similar Statement X Times
22.2.37. Comparing Single-Threaded and Multi-Threaded Performance
22.2.38. Incremental Versus Recomputed Aggregation for Named Window Events
22.2.39. When Does Memory Get Released
22.2.40. Measure throughput of non-matches as well as matches

22.3. Using the Performance Kit

22.3.1. How to Use the Performance Kit

This section describes performance best practices and explains how to assess runtime performance by using our provided performance kit.

22.1. Performance Results

For a complete understanding of those results, consult the next sections.

Esper exceeds over 500 000 event/s on a dual CPU 2GHz Intel based hardware,
with runtime latency below 3 microseconds average (below 10us with more than 
99% predictability) on a VWAP benchmark with 1000 statements registered in the system 
- this tops at 70 Mbit/s at 85% CPU usage.

Esper also demonstrates linear scalability from 100 000 to 500 000 event/s on this 
hardware, with consistent results accross different statements.

Other tests demonstrate equivalent performance results
(straight through processing, match all, match none, no statement registered,
VWAP with time based window or length based windows).
                
Tests on a laptop demonstrated about 5x time less performance - that is 
between 70 000 event/s and 200 000 event/s - which still gives room for easy 
testing on small configuration.

22.2. Performance Tips

22.2.1. Understand How to Tune Your Java Virtual Machine

The compiler and runtime run on a JVM and you need to be familiar with JVM tuning. Key parameters to consider include minimum and maximum heap memory and nursery heap sizes. Statements with time-based or length-based data windows can consume large amounts of memory as their size or length can be large.

For time-based data windows, one needs to be aware that the memory consumed depends on the actual event stream input throughput. Event pattern instances also consume memory, especially when using the "every" keyword in patterns to repeat pattern sub-expressions - which again will depend on the actual event stream input throughput.

22.2.2. Input and Output Bottlenecks

Your application receives output events from statements through the UpdateListener interface or via the strongly-typed subscriber POJO object. Such output events are delivered by the application or timer thread(s) that sends an input event into the runtime instance.

The processing of output events that your listener or subscriber performs temporarily blocks the thread until the processing completes, and may thus reduce throughput. It can therefore be beneficial for your application to process output events asynchronously and not block the runtime while an output event is being processed by your listener, especially if your listener code performs blocking IO operations.

For example, your application may want to send output events to a JMS destination or write output event data to a relational database. For optimal throughput, consider performing such blocking operations in a separate thread.

Additionally, when reading input events from a store or network in a performance test, you may find that the runtime processes events faster then you are able to feed events into the runtime. In such case you may want to consider an in-memory driver for use in performance testing. Also consider decoupling your read operation from the event processing operation (sendEvent method) by having multiple readers or by pre-fetching your data from the store.

22.2.3. Threading

We recommend using multiple threads to send events into the runtime. There is a test class below. Our test class does not use a blocking queue and thread pool so as to avoid a point of contention.

A sample code for testing performance with multiple threads is provided:

public class SampleClassThreading {

    public static void main(String[] args) throws InterruptedException {

        int numEvents = 1000000;
        int numThreads = 3;

        Configuration config = new Configuration();
        config.getRuntime().getThreading().setListenerDispatchPreserveOrder(false);
        config.getRuntime().getThreading().setInternalTimerEnabled(false);   // remove thread that handles time advancing
        config.getCommon().addEventType(MyEvent.class);

        String epl = "create context MyContext coalesce by consistent_hash_crc32(id) " +
                     "from MyEvent granularity 64 preallocate;\n" +
                     "@name('result') context MyContext select count(*) from MyEvent group by id;\n";
        EPCompiled compiled;
        try {
            compiled = EPCompilerProvider.getCompiler().compile(epl, new CompilerArguments(config);
        }
        catch (EPCompileException ex) {
            throw new RuntimeException(ex.getMessage(), ex);
        }
                
        EPRuntime runtime = EPRuntimeProvider.getDefaultRuntime(config);
        EPDeployment deployment;
        try {
            deployment = runtime.getDeploymentService().deploy(compiled);
        }
        catch (EPDeployException ex) {
            throw new RuntimeException(ex.getMessage(), ex);
        }
        EPStatement stmt = runtime.getDeploymentService().getStatement(deployment.getDeploymentId(), "result");
        stmt.setSubscriber(new MySubscriber());

        Thread[] threads = new Thread[numThreads];
        CountDownLatch latch = new CountDownLatch(numThreads);

        int eventsPerThreads = numEvents / numThreads;
        for (int i = 0; i < numThreads; i++) {
            threads[i] = new Thread(
              new MyRunnable(latch, eventsPerThreads, runtime.getEventService()));
        }
        long startTime = System.currentTimeMillis();
        for (int i = 0; i < numThreads; i++) {
            threads[i].start();
        }

        latch.await(10, TimeUnit.MINUTES);
        if (latch.getCount() > 0) {
            throw new RuntimeException("Failed to complete in 10 minute");
        }
        long delta = System.currentTimeMillis() - startTime;
        System.out.println("Took " + delta + " millis");
    }

    public static class MySubscriber {
        public void update(Object[] args) {
        }
    }

    public static class MyRunnable implements Runnable {
        private final CountDownLatch latch;
        private final int numEvents;
        private final EPEventService eventService;

        public MyRunnable(CountDownLatch latch, int numEvents, EPEventService eventService) {
            this.latch = latch;
            this.numEvents = numEvents;
            this.eventService = eventService;
        }

        public void run() {
            Random r = new Random();
            for (int i = 0; i < numEvents; i++) {
                eventService.sendEventBean(new MyEvent(r.nextInt(512)), "MyEvent");
            }
            latch.countDown();
        }
    }

    public static class MyEvent {
        private final int id;

        public MyEvent(int id) {
            this.id = id;
        }

        public int getId() {
            return id;
        }
    }
}

We recommend using Java threads as above, or a blocking queue and thread pool with sendEventType or alternatively we recommend configuring inbound threading if your application does not already employ threading. The runtime provides the configuration option to use runtime-level queues and threadpools for inbound, outbound and internal executions. See Section 15.8.1, “Advanced Threading” for more information.

We recommend the outbound threading if your listeners are blocking. For outbound threading also see the section below on tuning and disabling listener delivery guarantees.

If enabling advanced threading options keep in mind that the runtime will maintain a queue and thread pool. There is additional overhead associated with entering work units into the queue, maintaining the queue and the hand-off between threads. The Java blocking queues are not necessarily fast on all JVM. It is not necessarily true that your application will perform better with any of the advanced threading options.

We found scalability better on Linux systems and running Java with -server and pinning threads to exclusive CPUs and after making sure CPUs are available on your system.

We recommend looking at LMAX Disruptor, an inter-thread messaging library, for setting up processing stages. Disruptor, however, is reportedly less suitable for setting up a worker pool.

22.2.3.1. Thead Pool Pattern

The sample code below may help you get started setting up a thread pool of workers with back pressure and consideration for IO threads and clean shutdown.

The sample code starts by setting up a thread factory:

private static class RuntimeThreadFactory implements ThreadFactory {
  private AtomicInteger id = new AtomicInteger(0);

  public Thread newThread(Runnable r) {
    Thread t = new Thread(r, "Event Runtime Thread #" + id.incrementAndGet());
    t.setDaemon(true);
    t.setPriority(Thread.NORM_PRIORITY);
    return t;
  }
}

The sample uses a fixed-size array blocking queue. To handle the situation where the queue is full and accepts no more messages, it uses a rejection handler that counts the number of rejections and retries:

private class RuntimeRejectionHandler implements RejectedExecutionHandler {
  private volatile long spinCount = 0;
  
  public long getSpinCount() {
    return spinCount;
  }

  public void rejectedExecution(Runnable r, ThreadPoolExecutor executor) {
    ++spinCount;

    try {
      boolean isAccepted = false;
      while (!isAccepted) {
        isAccepted = executorQueue.offer(r, 120, TimeUnit.MICROSECONDS);
      }
    }
    catch (InterruptedException e) {
      log.warn("could not queue work entry");
    }
  }
}

The Runnable that submits an event for processing could look like this:

class Holder implements Runnable {
  public void run() {
    // do any stuff needed to "prepare" event which doesn't involve IO
    runtime.getEventService().sendEventBean(lm, "LMEventType");
  }
}

Initialize the queue and worker pool as follows:

  private final static int CAPACITY = 10000;
  private final static int THREAD_COUNT = 4;

  private static EPRuntime runtime;
  private ThreadFactory threadFactory = new RuntimeThreadFactory();
  private RuntimeRejectionHandler rejectionHandler = new RuntimeRejectionHandler();
  private BlockingQueue<Runnable> executorQueue;
  private ThreadPoolExecutor executor;

  public void start() {
    executorQueue = new ArrayBlockingQueue<Runnable>(CAPACITY);
    executor = new ThreadPoolExecutor(THREAD_COUNT, THREAD_COUNT, 0, TimeUnit.SECONDS,
    executorQueue, threadFactory, rejectionHandler);
    executor.allowCoreThreadTimeOut(false);
    while (executor.getPoolSize() < executor.getCorePoolSize()) {
      executor.prestartCoreThread();
    }
  }

To shut down cleanly, and before destroying the runtime, the sample code is:

  executor.shutdown();
  while (!executor.isTerminated()) {
    Thread.sleep(100);
  }

The next sample code goes into the IO or input thread(s) such as NIO mapped file, file channel, socket channel, or zmq / nanomsg etc., and submits a work unit to the queue:

  while (programAlive) {
    // deserialize event to POJO, Map, Array, etc.,
    // pass along an event type name when needed
    executor.execute(new Holder(myeventobject));
  }

You could periodically dump the spinCount variable to get an idea of queue depth. You can tune the size of the Executor's pool, and the size of the TimeUnit's of sleep used inside the rejectedExecution method, until you get 1) stable performance at highest level (determined by optimal number of threads in pool, 2) avoid wasting CPU in IO thread(s) (determined by optimal sleeping time between each attempt to re-queue rejected events to the thread pool).

22.2.4. Select the Underlying Event Rather Than Individual Fields

By selecting the underlying event in the select-clause you can reduce load on the runtime, since the runtime does not need to generate a new output event for each input event.

For example, the following statement returns the underlying event to update listeners:

// Better performance
select * from RFIDEvent

In comparison, the next statement selects individual properties. This statement requires the runtime to generate an output event that contains exactly the required properties:

// Less good performance
select assetId, zone, xlocation, ylocation from RFIDEvent

22.2.5. Prefer Stream-Level Filtering Over Where-Clause Filtering

The runtime stream-level filtering is very well optimized, while filtering via the where-clause post any data windows is not optimized.

The same is true for named windows. If your application is only interested in a subset of named window data and such filters are not correlated to arriving events, place the filters into parenthesis after the named window name.

22.2.5.1. Examples Without Named Windows

Consider the example below, which performs stream-level filtering:

// Better performance : stream-level filtering
select * from MarketData(ticker = 'GOOG')

The example below is the equivalent (same semantics) statement and performs post-data-window filtering without a data window. The compiler does not optimize statements that filter in the where-clause for the reason that data windows are generally present.

// Less good performance : post-data-window filtering
select * from Market where ticker = 'GOOG'

Thus this optimization technique applies to statements without any data window.

When a data window is used, the semantics change. Let's look at an example to better understand the difference: In the next statement only GOOG market events enter the length window:

select avg(price) from MarketData(ticker = 'GOOG')#length(100)

The above statement computes the average price of GOOG market data events for the last 100 GOOG market data events.

Compare the filter position to a filter in the where clause. The following statement is NOT equivalent as all events enter the data window (not just GOOG events):

select avg(price) from Market#length(100) where ticker = 'GOOG'

The statement above computes the average price of all market data events for the last 100 market data events, and outputs results only for GOOG.

22.2.5.2. Examples Using Named Windows

The next two example statements put the account number filter criteria directly into parenthesis following the named window name:

// Better performance : stream-level filtering
select * from WithdrawalNamedWindow(accountNumber = '123')

// Better performance : example with subquery
select *, (select * from LoginSucceededWindow(accountNumber = '123'))
from WithdrawalNamedWindow(accountNumber = '123')

22.2.5.3. Common Computations in Where-Clauses

If you have a number of statements performing a given computation on incoming events, consider moving the computation from the where-clause to a plug-in user-defined function that is listed as part of stream-level filter criteria. The compiler optimizes evaluation of user-defined functions in filters such that an incoming event can undergo the computation just once even in the presence of N statements.

// Prefer stream-level filtering with a user-defined function
select * from MarketData(vstCompare(*))

// Less preferable when there are N similar statements: 
// Move the computation in the where-clause to the "vstCompare" function.
select * from MarketData where (VST * RT) – (VST / RT) > 1

22.2.6. Reduce the Use of Arithmetic in Expressions

The compiler and runtime do not yet pre-evaluate arithmetic expressions that produce constant results, however since the compiler generates byte code the JVM byte code optimization takes place and may pre-evaluate certain expressions.

Therefore, a filter expression as below is optimized:

// Better performance : no arithmetic
select * from MarketData(price>40)

While the compiler cannot currently optimize this expression:

// Less good performance : with arithmetic
select * from MarketData(price+10>50)

22.2.7. Remove Unneccessary Constructs

If your statement uses order by to order output events, consider removing order by unless your application does indeed require the events it receives to be ordered.

If your statement specifies group by but does not use aggregation functions, consider removing group by.

If your statement specifies group by but the filter criteria only allows one group, consider removing group by:

// Prefer:
select * from MarketData(symbol = 'GE') having sum(price) > 1000

// Don't use this since the filter specifies a single symbol:
select * from MarketData(symbol = 'GE') group by symbol having sum(price) > 1000

If your statement specifies the grouped data window #groupwin but the window being grouped retains the same set of events regardless of grouping, remove #groupwin, for example:

// Prefer:
create window MarketDataWindow#keepall as MarketDataEventType

// Don't use this, since keeping all events 
// or keeping all events per symbol is the same thing:
create window MarketDataWindow#groupwin(symbol)#keepall as MarketDataEventType

// Don't use this, since keeping the last 1-minute of events 
// or keeping 1-minute of events per symbol is the same thing:
create window MarketDataWindow#groupwin(symbol)#time(1 min) as MarketDataEventType

It is not necessary to specify a data window for each stream.

// Prefer:
select * from MarketDataWindow

// Don't have a data window if just listening to events, prefer the above
select * from MarketDataWindow#lastevent

If your statement specifies unique data window but the filter criteria only allows one unique criteria, consider removing the unique data window:

// Prefer:
select * from MarketDataWindow(symbol = 'GE')#lastevent

// Don't have a unique-key data window if your filter specifies a single value
select * from MarketDataWindow(symbol = 'GE')#unique(symbol)

22.2.8. End Pattern Sub-Expressions

In patterns, the every keyword in conjunction with followed by (->) starts a new sub-expression per match.

For example, the following pattern starts a sub-expression looking for a B event for every A event that arrives.

every A -> B

Determine under what conditions a subexpression should end so the runtime can stop looking for a B event. Here are a few generic examples:

every A -> (B and not C)
every A -> B where timer:within(1 sec)

22.2.9. Consider Using EventPropertyGetter for Fast Access to Event Properties

The EventPropertyGetter interface is useful for obtaining an event property value without property name table lookup given an EventBean instance that is of the same event type that the property getter was obtained from.

When compiling a statement, the EPStatement instance lets us know the EventType via the getEventType() method. From the EventType you can obtain EventPropertyGetter instances for named event properties.

To demonstrate, consider the following simple statement:

select symbol, avg(price) from Market group by symbol

After compiling and deploying the module, obtain the EventType and pass the type to the listener:

EPStatement stmt = runtime.getDeploymentService().getStatement(deploymentId, statementName);
MyGetterUpdateListener listener = new MyGetterUpdateListener(stmt.getEventType());

The listener can use the type to obtain fast getters for property values of events for the same type:

public class MyGetterUpdateListener implements StatementAwareUpdateListener {
    private final EventPropertyGetter symbolGetter;
    private final EventPropertyGetter avgPriceGetter;

    public MyGetterUpdateListener(EventType eventType) {
        symbolGetter = eventType.getGetter("symbol");
        avgPriceGetter = eventType.getGetter("avg(price)");
    }

Last, the update method can invoke the getters to obtain event property values:

    public void update(EventBean[] eventBeans, EventBean[] oldBeans, EPStatement epStatement, EPRuntime runtime) {
        String symbol = (String) symbolGetter.get(eventBeans[0]);
        long volume = (Long) volumeGetter.get(eventBeans[0]);
        // some more logic here
    }

22.2.10. Consider Casting the Underlying Event

When an application requires the value of most or all event properties, it can often be best to simply select the underlying event via wildcard and cast the received events.

Let's look at the sample statement:

select * from MarketData(symbol regexp 'E[a-z]')

An update listener to the statement may want to cast the received events to the expected underlying event class:

    public void update(EventBean[] eventBeans, EventBean[] eventBeans) {
        MarketData md = (MarketData) eventBeans[0].getUnderlying();
        // some more logic here
    }

22.2.11. Turn Off Logging and Audit

Even if you don't have a log4j configuration file in place, the runtime will make sure to minimize execution path logging overhead. For prior versions, and to reduce logging overhead overall, we recommend the "WARN" log level or the "INFO" log level.

Please see the log4j configuration file in "etc/infoonly_log4j.xml" for example log4j settings.

EPL provides the @Audit annotation for statements. For performance testing and production deployment, we recommend removing @Audit.

22.2.12. Tune or Disable Delivery Order Guarantees

If your application is not a multithreaded application, or you application is not sensitive to the order of delivery of result events to your application listeners, then consider disabling the delivery order guarantees the runtime makes towards ordered delivery of results to listeners:

Configuration config = new Configuration();
config.getRuntime().getThreading().setListenerDispatchPreserveOrder(false);

If your application is not a multithreaded application, or your application uses the insert into clause to make results of one statement available for further consuming statements but does not require ordered delivery of results from producing statements to consuming statements, you may disable delivery order guarantees between statements:

Configuration config = new Configuration();
config.getRuntime().getThreading().setInsertIntoDispatchPreserveOrder(false);

If your application declares only stateless statements then the settings described herein are not relevant.

Additional configuration options are available and described in the configuration section that specify timeout values and spin or thread context switching.

the runtime logging will log the following informational message when guaranteed delivery order to listeners is enabled and spin lock times exceed the default or configured timeout : Spin wait timeout exceeded in listener dispatch. The respective message for delivery from insert into statements to consuming statements is Spin wait timeout exceeded in insert-into dispatch.

If your application sees messages that spin lock times are exceeded, your application has several options: First, disabling preserve order is an option. Second, ensure your listener does not perform (long-running) blocking operations before returning, for example by performing output event processing in a separate thread. Third, change the timeout value to a larger number to block longer without logging the message.

22.2.13. Use a Subscriber Object to Receive Events

The subscriber object is a technique to receive result data that has performance advantages over the UpdateListener interface. Please refer to Section 15.5.2, “Setting a Subscriber Object”.

22.2.14. Consider Data Flows

Data flows offer a high-performance means to execute EPL select statements and use other built-in data flow operators. The data flow Emitter operator allows sending underlying event objects directly into a data flow. Thereby the runtime does not need to wrap each underlying event into a EventBean instance and the runtime does not need to match events to statements. Instead, the underling event directly applies to only that data flow instance that your application submits the event to, and no other statements or data flows see the same event.

Data flows are described in Chapter 19, EPL Reference: Data Flow.

22.2.15. High-Arrival-Rate Streams and Single Statements

A context partition is associated with certain context partition state that consists of current aggregation values, partial pattern matches, data windows depending on whether your statement uses such constructs. When an runtime receives events it updates context partition state under locking such that context partition state remains consistent under concurrent multi-threaded access.

For high-volume streams, the locking required to protected context partition state may slow down or introduce blocking for very high arrival rates of events that apply to the very same context partition and its state.

Your first choice should be to utilize a context that allows for multiple context partitions, such as the hash segmented context. The hash segmented context usually performs better compared to the keyed segmented context since in the keyed segmented context the runtime must check whether a partition exists or must be created for a given key.

Your second choice is to split the statement into multiple statements that each perform part of the intended function or that each look for a certain subset of the high-arrival-rate stream. There is very little cost in terms of memory or CPU resources per statement, the runtime can handle larger number of statements usually as efficiently as single statements.

For example, consider the following statement:

// less effective in a highly threaded environment 
select venue, ccyPair, side, sum(qty)
from CumulativePrice
where side='O'
group by venue, ccyPair, side

The runtime protects state of each context partition by a separate lock for each context partition, as discussed in the API section. In highly threaded applications threads may block on a specific context partition. You would therefore want to use multiple context partitions.

Consider creating either a hash segmented context or a keyed segmented context. In the hash segmented context incoming data is simply assigned to one of the buckets using a small computation. In the keyed segmented context the runtime must check keys to see if a partition already exists or whether a new partition must be allocated. We'll discuss both below. For both types of context, since locking is on the level of context partition, the locks taken by the runtime are very fine grained allowing for highly concurrent processing.

This sample EPL declares a hash segmented context. In a hash segmented context the runtime can pre-allocate context partitions and therefore does not need to check whether a partition exists already. In a hash segmented context the runtime simply assigns events to context partitions based on result of a hash function and modulo operation.

create context MyContext coalesce by consistent_hash_crc32(venue) from CumulativePrice(side='O') granularity 16 preallocate

This sample EPL declares a keyed segmented context. The keyed segmented context instructs the runtime to employ a context partition per venue, ccyPair, side key combination. The runtime must check for each event whether a partition exists for that combination of venue, ccyPair and side:

create context MyContext partition by venue, ccyPair, side from CumulativePrice(side='O')

After declaring the context using create context, make sure all your statements, including those statements that create named windows and tables, specify that context. This is done by prefixing each statement with context context_name .....

The new statement that refers to the context as created above is below. Note the context MyContext which tells the runtime that this statement executes context partitioned. This must be provided otherwise the statement does not execute context partitioned.

context MyContext select venue, ccyPair, side, sum(qty) from CumulativePrice

For testing purposes or if your application controls concurrency, you may disable context partition locking, see Section 16.6.10.3, “Disable Locking”.

22.2.16. Subqueries Versus Joins and Where-Clause and Data Windows

When joining streams the runtime builds a product of the joined data windows based on the where clause. It analyzes the where clause at time of statement compilation and builds the appropriate indexes and query strategy. Avoid using expressions in the join where clause that require evaluation, such as user-defined functions or arithmatic expressions.

When joining streams and not providing a where clause, consider using the #unique data window or #lastevent data window to join only the last event or the last event per unique key(s) of each stream.

The sample statement below can produce up to 5,000 rows when both data windows are filled and an event arrives for either stream:

// Avoid joins between streams with data windows without where-clause
select * from StreamA#length(100), StreamB#length(50)

Consider using a subquery, consider using separate statements with insert-into and consider providing a where clause to limit the product of rows.

Below examples show different approaches, that are not semantically equivalent, assuming that an MyEvent is defined with the properties symbol and value:

// Replace the following statement as it may not perform well
select a.symbol, avg(a.value), avg(b.value) 
from MyEvent#length(100) a, MyEvent#length(50) b

// Join with where-clause
select a.symbol, avg(a.value), avg(b.value) 
from MyEvent#length(100) a, MyEvent#length(50) b 
where a.symbol = b.symbol

// Unidirectional join with where-clause
select a.symbol, avg(b.value) 
from MyEvent unidirectional, MyEvent#length(50) b 
where a.symbol = b.symbol

// Subquery
select 
  (select avg(value) from MyEvent#length(100)) as avgA, 
  (select avg(value) from MyEvent#length(50)) as avgB,
  a.symbol
from MyEvent

// Since streams cost almost nothing, use insert-into to populate and a unidirectional join 
insert into StreamAvgA select symbol, avg(value) as avgA from MyEvent#length(100)
insert into StreamAvgB select symbol, avg(value) as avgB from MyEvent#length(50)
select a.symbol, avgA, avgB from StreamAvgA unidirectional, StreamAvgB#unique(symbol) b
where a.symbol = b.symbol

A join is multidirectionally evaluated: When an event of any of the streams participating in the join arrive, the join gets evaluated, unless using the unidirectional keyword. Consider using a subquery instead when evaluation only needs to take place when a certain event arrives:

// Rewrite this join since you don't need to join when a LoginSucceededWindow arrives
// Also rewrite because the account number always is the value 123.
select * from LoginSucceededWindow as l, WithdrawalWindow as w
where w.accountNumber = '123' and w.accountNumber = l.accountNumber

// Rewritten as a subquery, 
select *, (select * from LoginSucceededWindow where accountNumber=’123’) 
from WithdrawalWindow(accountNumber=’123’) as w

22.2.17. Patterns and Pattern Sub-Expression Instances

The every and repeat operators in patterns control the number of sub-expressions that are active. Each sub-expression can consume memory as it may retain, depending on the use of tags in the pattern, the matching events. A large number of active sub-expressions can reduce performance or lead to out-of-memory errors.

During the design of the pattern statement consider the use of timer:within to reduce the amount of time a sub-expression lives, or consider the not operator to end a sub-expression.

The examples herein assume an AEvent and a BEvent event type that have an id property that may correlate between arriving events of the two event types.

In the following sample pattern the runtime starts, for each arriving AEvent, a new pattern sub-expression looking for a matching BEvent. Since the AEvent is tagged with a the runtime retains each AEvent until a match is found for delivery to listeners or subscribers:

every a=AEvent -> b=BEvent(b.id = a.id)

One way to end a sub-expression is to attach a time how long it may be active.

The next statement ends sub-expressions looking for a matching BEvent 10 seconds after arrival of the AEvent event that started the sub-expression:

every a=AEvent -> (b=BEvent(b.id = a.id) where timer:within(10 sec))

A second way to end a sub-expression is to use the not operator. You can use the not operator together with the and operator to end a sub-expression when a certain event arrives.

The next statement ends sub-expressions looking for a matching BEvent when, in the order of arrival, the next BEvent that arrives after the AEvent event that started the sub-expression does not match the id of the AEvent:

every a=AEvent -> (b=BEvent(b.id = a.id) and not BEvent(b.id != a.id))

The every-distinct operator can be used to keep one sub-expression alive per one or more keys. The next pattern demonstrates an alternative to every-distinct. It ends sub-expressions looking for a matching BEvent when an AEvent arrives that matches the id of the AEvent that started the sub-expression:

every a=AEvent -> (b=BEvent(b.id = a.id) and not AEvent(b.id = a.id))

22.2.18. Pattern Sub-Expression Instance Versus Data Window Use

For some use cases you can either specify one or more data windows as the solution, or you can specify a pattern that also solves your use case.

For patterns, you should understand that the runtime employs a dynamic state machine. For data windows, the runtime employs a delta network and collections. Generally you may find patterns that require a large number of sub-expression instances to consume more memory and more CPU then data windows.

For example, consider the following statement that filters out duplicate transaction ids that occur within 20 seconds of each other:

select * from TxnEvent#firstunique(transactionId)#time(20 sec)

You could also address this solution using a pattern:

select * from pattern [every-distinct(a.transactionId) a=TxnEvent where timer:within(20 sec)]

If you have a fairly large number of different transaction ids to track, you may find the pattern to perform less well then the data window solution as the pattern asks the runtime to manage a pattern sub-expression per transaction id. The data window solution asks the runtime to manage expiry, which can give better performance in many cases.

22.2.19. The Keep-All Data Window

The #keepall data window is a data window that retains all arriving events. The data window can be useful during the development phase and to implement a custom expiry policy using on-delete and named windows. Care should be taken to timely remove from the keep-all data window however. Use on-select or fire-and-forget queries to count the number of rows currently held by a named window with keep-all expiry policy.

22.2.20. Statement Design for Reduced Memory Consumption - Diagnosing OutOfMemoryError

This section describes common sources of out-of-memory problems.

If using the keep-all data window please consider the information above. If using pattern statements please consider pattern sub-expression instantiation and lifetime as discussed prior to this section.

When using the group-by clause or #groupwin grouped data windows please consider the hints as described below. Make sure your grouping criteria are fields that don't have an unlimited number of possible values or specify hints otherwise.

The #unique unique data window can also be a source for error. If your uniqueness criteria include a field which is never unique the memory use of the data window can grow, unless your application deletes events.

When using the every-distinct pattern construct parameterized by distinct value expressions that generate an unlimited number of distinct values, consider specifying a time period as part of the parameters to indicate to the runtime how long a distinct value should be considered.

In a match-recognize pattern consider limiting the number of optional events if optional events are part of the data reported in the measures clause. Also when using the partition clause, if your partitioning criteria include a field which is never unique the memory use of the match-recognize runtime can grow.

A further source of memory use is when your application deploys modules but fails to undeploy modules when they are no longer needed.

In your application design you may also want to be conscious when the application listener or subscriber objects retain output data.

A runtime, uniquely identified by a runtime URI is a relatively heavyweight object. Optimally your application allocates less than one-thousand (1000) runtime instances per JVM. A statement instance is associated to one runtime instance, is uniquely identified by a statement name and is a medium weight object. We have seen applications allocate 100,000 statements easily. A statement's context partition instance is associated to one statement, is uniquely identified by a context partition id and is a light weight object. We have seen applications allocate 5000 context partitions for 100 statements easily, i.e. 5,000,000 context partitions. An aggregation row, data window row, pattern etc. is associated to a statement context partition and is a very lightweight object itself.

The prev, prevwindow and prevtail functions access a data window directly. The runtime does not need to maintain a separate data structure and grouping is based on the use of the #groupwin grouped data window. Compare this to the use of event aggregation functions such as first, window and last which group according to the group by clause. If your statement utilizes both together consider reformulating to use prev instead.

22.2.21. Performance, JVM, OS and Hardware

Performance will also depend on your JVM (Sun HotSpot, BEA JRockit, IBM J9), your operating system and your hardware. A JVM performance index such as specJBB at spec.org can be used. For memory intensive statement, you may want to consider 64bit architecture that can address more than 2GB or 3GB of memory, although a 64bit JVM usually comes with a slow performance penalty due to more complex pointer address management.

The choice of JVM, OS and hardware depends on a number of factors and therefore a definite suggestion is hard to make. The choice depends on the number of statements, and number of threads. A larger number of threads would benefit of more CPU and cores. If you have very low latency requirements, you should consider getting more GHz per core, and possibly soft real-time JVM to enforce GC determinism at the JVM level, or even consider dedicated hardware such as Azul. If your statements utilize large data windows, more RAM and heap space will be utilized hence you should clearly plan and account for that and possibly consider 64bit architectures or consider EsperHA.

The number and type of statements is a factor that cannot be generically accounted for. The benchmark kit can help test out some requirements and establish baselines, and for more complex use cases a simulation or proof of concept would certainly works best. EsperTech' experts can be available to help write interfaces in a consulting relationship.

22.2.22. Consider Using Hints

The @Hint annotation provides a single keyword or a comma-separated list of keywords that provide instructions to the compiler and runtime towards statement execution that affect runtime performance and memory-use of statements. Also see Section 5.2.7.9, “@Hint”.

The query planning in general is described in Section 22.2.32, “Notes on Query Planning”.

The hint for influencing query planning expression analysis is described at Section 22.2.33, “Query Planning Expression Analysis Hints”.

The hint for influencing query planning index choice is described at Section 22.2.34, “Query Planning Index Hints”.

Further hints, also related to query planning, for use with joins, outer joins, unidirectional joins, relational and non-relational joins are described in Section 5.12.6, “Hints Related to Joins”.

The hint for use with group by to specify how state for groups is reclaimed is described in Section 5.6.2.1, “Hints Pertaining to Group-By” and Section 13.3.15, “Grouped Data Window (groupwin or std:groupwin)”.

The hint for use with group by to specify aggregation state reclaim for unbound streams and timestamp groups is described in Section 5.6.2.1, “Hints Pertaining to Group-By”.

The hint for use with match_recognize to specify iterate-only is described in Section 8.4.7, “Eliminating Duplicate Matches”.

To tune subquery performance when your subquery selects from a named window, consider the hints discussed in Section 5.11.8, “Hints Related to Subqueries”.

The @NoLock hint to remove context partition locking (also read caution note) is described at Section 15.8, “Runtime Threading and Concurrency”.

The hint to control expansion of filter expressions, further described at Section 16.5.8.1, “Filter Service Max Filter Width”.

22.2.23. Optimizing Stream Filter Expressions

Assume your statement invokes a static method in the stream filter as the below statement shows as an example:

select * from MyEvent(MyHelperLibrary.filter(field1, field2, field3, field4*field5))

As a result of starting above statement, the runtime must evaluate each MyEvent event invoking the MyHelperLibrary.filter method and passing certain event properties. The same applies to pattern filters that specify functions to evaluate.

If possible, consider moving some of the checking performed by the function back into the filter or consider splitting the function into a two parts separated by and conjunction. In general for all expressions, the runtime evaluates expressions left of the and first and can skip evaluation of the further expressions in the conjunction in the case when the first expression returns false. In addition the compiler can determine filter index fields and the runtime can build a filter index for fields provided in stream or pattern filters.

For example, the below statement could be faster to evaluate:

select * from MyEvent(field1="value" and 
  MyHelperLibrary.filter(field1, field2, field3, field4*field5))

22.2.24. Statement and Runtime Metric Reporting

You can use statement and runtime metric reporting as described in Section 15.12, “Runtime and Statement Metrics Reporting” to monitor performance or identify slow statements.

22.2.25. Expression Evaluation Order and Early Exit

The term "early exit" or "short-circuit evaluation" refers to when the runtime can evaluate an expression without a complete evaluation of all sub-expressions.

Consider an expression such as follows:

where expr1 and expr2 and expr3

If expr1 is false the runtime does not need to evaluate expr2 and expr3. Therefore when using the AND logical operator consider reordering expressions placing the most-selective expression first and less selective expressions thereafter.

The same is true for the OR logical operator: If expr1 is true the runtime does not need to evaluate expr2 and expr3. Therefore when using the OR logical operator consider reordering expressions placing the least-selective expression first and more selective expressions thereafter.

The order of expressions (here: expr1, expr2 and expr3) does not make a difference for the join and subquery query planner.

Note that the runtime does not guarantee short-circuit evaluation in all cases. The runtime may rewrite the where-clause or filter conditions into another order of evaluation so that it can perform index lookups.

22.2.26. Large Number of Threads

When using a large number of threads with the runtime, such as more then 100 threads, you can provide a setting in the configuration that instructs the runtime to reduce the use of thread-local variables. Please see Section 16.6.10, “Runtime Settings Related to Execution of Statements” for more information.

22.2.27. Filter Evaluation Tuning

We offer a switch for tuning evaluation of incoming events against filters. Please see Section 16.6.10, “Runtime Settings Related to Execution of Statements” for more information.

22.2.28. Context Partition Related Information

As the runtime locks on the level of context partition, high concurrency under threading can be achieved by using context partitions.

Generally context partitions require more memory then the more fine-grained grouping that can be achieved by group by or #groupwin.

22.2.29. Prefer Constant Variables Over Non-Constant Variables

The create-variable syntax as well as the APIs can identify a variable as a constant value. When a variable's value is not intended to change it is best to declare the variable as constant.

For example, consider the following two statements that each declares a variable. The first statement declares a constant variable and the second statement declares a non-constant variable:

// declare a constant variable
create constant variable CONST_DEPARTMENT = 'PURCHASING'

// declare a non-constant variable
create variable VAR_DEPARTMENT = 'SALES'

When your application compiles a statement that has filters for events according to variable values, the compiler internally inspects such expressions and performs filter optimizations for constant variables that are more effective in evaluation.

For example, consider the following two statements that each look for events related to persons that belong to a given department:

// perfer the constant
select * from PersonEvent(department=CONST_DEPARTMENT)

// less efficient
select * from PersonEvent(department=VAR_DEPARTMENT)

The runtime can more efficiently evaluate the expression using a variable declared as constant. The same observation can be made for subquery and join query planning.

22.2.30. Prefer Object-Array Events

Object-array events offer the best read access performance for access to event property values. In addition, object-array events use much less memory then Map-type events. They also offer the best write access performance.

A comparison of different event representations is in Section 3.5, “Comparing Event Representations”.

First, we recommend that your application sends object-array events into the runtime, instead of Map-type events. See Appendix F, Event Representation: Object-Array (Object[]) Events for more information.

Second, we recommend that your application sets the compiler configuration of the default event representation to object array, as described in Section 16.4.8.1, “Default Event Representation”. Alternatively you can use the @EventRepresentation(objectarray) annotation with individual statements.

22.2.31. Composite or Compound Keys

If your uniqueness, grouping, sorting or partitioning keys are composite keys or compound keys, this section may apply. A composite key is a key that consists of 2 or more properties or expressions.

In the example below the firstName and lastName expressions are part of a composite key:

... group by firstName, lastName
..#unique(firstName, lastName)...
...order by firstName, lastName

Note

The example above is not a comprehensive discussion where composite or compound keys may be used in EPL. Other places where composite keys may apply are patterns, partitioned contexts and grouped data windows (we may have missed one).

You application could change the EPL to instead refer to a single value fullName:

... group by fullName
..#unique(fullName)...
...order by fullName

The advantage in using a single expression as the uniqueness, grouping and sorting key is that the runtime does not need to compute multiple expressions and retain a separate data structure in memory that represents the composite key, resulting in reduced memory use and increased throughput.

22.2.32. Notes on Query Planning

Query planning takes place for subqueries, joins (any type), named window and table on-actions (on-select, on-merge, on-insert, on-update, on-select) and fire-and-forget queries. Query planning affects query execution speed. Enable query plan logging to output query plan information.

For query planning, the compiler draws information from:

The where-clauses, if any are specified. Where-clauses correlate streams, patterns, named windows, tables etc. with more streams, patterns, tables and named windows and are thus the main source of information for query planning.
The data window(s) declared on streams and named windows. The #unique and the #firstunique data window instruct the compiler to retain the last event per unique criteria.
For named windows and tables, the explicit indexes created via create unique index or create index.
For named windows (and not tables), the previously created implicit indexes. The compiler can plan to create implicit indexes automatically if explicit indexes do not match correlation requirements.
Any hints specified for the statement in question and including hints specified during the creation of named windows with create window.

The compiler prefers unique indexes over non-unique indexes.

The compiler prefers hash-based lookups (equals) and combination hash-btree lookups (equals and relational-operator or range) over btree lookups (relational-operator or range) over in-keyword (single and multi-index) lookup plans. This behavior can be controlled by hints that are discussed next.

22.2.33. Query Planning Expression Analysis Hints

The expression analysis hints impact query planning for any statement and fire-and-forget query that performs a join or subquery. They also impact named window and table on-action statements.

This hint instructs the compiler which expressions, operators or streams should be excluded and therefore not considered for query planning. The hint applies to the where-clause and, for outer joins, to the on-clause when present.

The hint takes a single expression as its sole parameter, which is placed in parenthesis. The expression must return a boolean value.

When the provided expression returns true for a given combination, that combination will not be considered for the query plan. A combination consists of a from-stream (name or number), a to-stream (name or number), an operator (i.e. equals, relational, in-keyword) and a set of expressions.

Table 22.1. Built-In Properties of the Expression Analysis Hint

Name	Type	Description
exprs	string-array (`String[]`)	Expression texts with minified whitespace.
from_streamname	string	The stream name of the stream providing lookup values as provided by the `as` keyword.
from_streamnum	int	The integer ordinal number of the stream providing lookup values as listed in the from-clause.
opname	string	The operator name. Valid values are `equals`, `relop` (relational operators and ranges) and `inkw` (`in`-keyword).
to_streamname	string	The stream name of the stream providing indexable values as provided by the `as` keyword.
to_streamnum	int	The integer ordinal number of the stream providing indexable values as listed in the from-clause.

Consider two event types A and B. Event type A has a property aprop and event type B has a property bprop. Let's assume A and B are related by aprop and bprop.

An inner join of all A and B events might look like this:

select * from A#keepall as a, B#keepall as b where aprop = bprop

In the default query plan, when an A event comes in, the runtime obtains the value of aprop and performs an index lookup against bprop values to obtain matching B events. Vice versa, when a B event comes in, the runtime obtains the value of bprop and performs an index lookup against aprop values to obtain matching A events.

The compiler evaluates the hint expression for each combination. The table below outlines the two rows provided to the hint expression:

Table 22.2. Built-In Properties of the Expression Analysis Hint

exprs	from_streamname	from_streamnum	opname	to_streamname	to_streamnum
`["aprop", "bprop"]`	`a`	`0`	`equals`	`b`	`1`
`["bprop", "aprop"]`	`b`	`1`	`equals`	`a`	`0`

The following statement with hint causes the analyzer to exclude all combinations since the expression passed in always returns true, in effect causing the query planner to always execute the statement as a full table scan.

@hint('exclude_plan(true)')
select * from A#keepall as a, B#keepall as b where aprop = bprop

This hint instructs the compiler to ignore all equals-operators for query planning:

@hint('exclude_plan(opname="equals")') select ....

The next hint instructs the compiler to ignore the equals-operator for the direction of lookup from A to B:

@hint('exclude_plan(opname="equals" and from_streamname="a")') select ....

Conversely, this hint instructs the compiler to ignore the equals-operator for the direction of lookup from B to A:

@hint('exclude_plan(opname="equals" and from_streamname="b")') select ....

Use the exprs array of expression texts to exclude specific expressions:

@hint('exclude_plan(exprs[0]="aprop")') select ....

For subqueries the stream number zero is the subquery from-clause itself and 1 to N are the enclosing statement's from-clause streams. For named window and table on-action statements the stream number zero is the named window or table and stream number 1 refers to the triggering pattern or event.

To specify multiple expressions, please specify multiple hints. The compiler excludes a specific combination when any of the hint expressions returns true.

To inspect values passed to the hint expression, please enable query plan logging. To inspect expression evaluation, please use @Audit.

22.2.34. Query Planning Index Hints

Currently index hints are only supported for the following types of statements:

Named window and table on-action statements (on-select, on-merge, on-insert, on-update, on-select).
Statements that have subselects against named windows that have index sharing enabled (the default is disabled).
Statements that have subselects against tables.
Fire-and-forget queries.

For the above statements, you may dictate to the compiler which explicit index (created via create index syntax) to use.

Specify the name of the explicit index in parentheses following @Hint and the index literal.

The following example instructs the compiler to use the UserProfileIndex if possible:

@Hint('index(UserProfileIndex)')

Add the literal bust to instruct the compiler to use the index, or if the compiler cannot use the index fail query planning with an exception and therefore fail statement compilation.

The following example instructs the compiler to use the UserProfileIndex if possible or fail with an exception if the index cannot be used:

@Hint('index(UserProfileIndex, bust)')

Multiple indexes can be listed separated by comma (,).

The next example instructs the compiler to consider the UserProfileIndex and the SessionIndex or fail with an exception if either index cannot be used:

@Hint('index(UserProfileIndex, SessionIndex, bust)')

The literal explicit can be added to instruct the compiler to use only explicitly created indexes.

The final example instructs the compiler to consider any explicitly create index or fail with an exception if any of the explicitly created indexes cannot be used:

@Hint('index(explicit, bust)')

22.2.35. Measuring Throughput

We recommend using System.nanoTime() to measure elapsed time when processing a batch of, for example, 1000 events.

Note that System.nanoTime() provides nanosecond precision, but not necessarily nanosecond resolution.

Therefore don't try to measure the time spent by the runtime processing a single event: The resolution of System.nanoTime() is not sufficient. Also, there are reports that System.nanoTime() can be actually go "backwards" and may not always behave as expected under threading. Please check your JVM platform documentation.

In the default configuration, the best way to measure performance is to take nano time, send a large number of events, for example 10.000 events, and take nano time again reporting on the difference between the two numbers.

If your configuration has inbound threading or other threading options set, you should either monitor the queue depth to determine performance, or disable threading options when measuring performance, or have your application use multiple threads to send events instead.

22.2.36. Do Not Create the Same or Similar Statement X Times

It is vastly more efficient to create a statement once and attach multiple listeners, then to create the same statement X times.

It is vastly more efficient to use context declarations to factor out commonalities between statements then creating X similar statements.

EPL, the compiler and runtime are optimized for low-latency and high-throughput execution. In order to accomplish that the compiler analyzes and query-plans. Certain information within each statement can effectively shared in the runtime (indexes) so that the runtime can remove duplication of processing and thus the runtime can achieve low-latency and high-throughput. The tradeoff is that the compiler must, for each statement, perform some upfront analysis.

Since your goal will be to make all test code as realistic, real-world and production-like as possible, we recommend against production code or test code deploying the same exact statement multiple times. Instead consider creating the same statement once and attaching multiple listeners. The compiler and runtime do not try to detect duplicate statements, since that can easily be done by your application.

Let's assume your test statement computes an aggregation over a 1-minute time window, for example select symbol, count(*) from StockTick#time(1 min) group by symbol. If your code creates the same statement 100 times the code instructs the runtime to track 100 logically independent time windows and to track aggregations for each group 100 times. Obviously, this is not a good use of EPL and the design of your statements and code may not be optimal.

Consider the world of relational databases. Your code could attach to a relational database, create the same table with a different name 100 times, and populate each of the 100 different tables with the same row data. A relational database administrator would probably recommend against creating 100 identical tables holding the same row data. Compare a statement to a relational database table in respect to how many there should be. In a good design there are limited number of statements. The runtime is not specifically designed for very large number of statements. Similarly a relational database schema design that has 100,000 tables would be something one would seriously question. It depends on the statement itself in respect to how many statements fit into memory and there is no general guideline.

EPL allows you the freedom to design your EPL in a way that reuses state and processing. For example, your EPL design could utilize a named window instead of allocating 100 independent time window. Since named windows are shared, the runtime only needs to track one time window instead of 100. And your EPL design could use an EPL table to maintain aggregations once and in a central place, so that tracking counts per symbol is done once instead of 100 times.

Context declarations can be an efficient way to take commonalities between statements (things that are similar between multiple statements) and factor them out into a context declaration. Instead of creating X similar statements, declare a context and attach one statement to the context, thus having X context partitions. This eliminates compiling and/or deploying X same statements. Using context the compiler only needs to analyze the context declaration and the statement. Your application can send start and stop events to control which context partitions exist and what events each context partition analyzes. Use the context partition administrative API to browse or terminate context partitions.

For example, assume you need to create 100000 similar statements that all filter GeoEvent events:

create schema GeoEvent(id string, value int, marker string)

@name('statment-1) select * from GeoEvent(id = '0001', value between 10 and 20, marker in ('a', 'b'))

@name('statment-N) select * from GeoEvent(id = '0002', value between 20 and 30, marker in ('c', 'd'))

If your application compiles and deploys 100k statements as above, the compiler must analyze and query plan each statement separately, and the runtime must enter each set of filter criteria into a shared filter index tree. Remember that the runtime can process incoming events very fast, with low latency and high throughput, even for 100k statements. However compiling and deploying 100k individual statements does take CPU time.

In this example, the statements have similar filters: id = an_id, value between start_range and end_range and marker in (markers). You could say that statements are similar and look like:

select * from GeoEvent(id=an_id, value between start_range and end_range, marker in (markers))

The an_id, start_range, end_range and markers are essential parameters to an instance of the filtering statement. Instances of statements are context partitions. Declare a context to refactor and change our design so the common filters are in one place. This apprach just requires two statements: the context declaration and the statement with the filters. You may declare two event types: one to allocate new context partitions and one to terminate context partitions.

Start by creating an event type that controls which instances of the filtering statement (the context partitions) are active:

create schema InitEvent(id string, startRange int, endRange int, markers string[])

Next, create an event type that controls when a context partition terminates:

create schema TermEvent(id string)

The context declaration tells the runtime that when an InitEvent arrives you want have a new instance that is parameterized by the InitEvent properties:

create context GeoEventFilterContext
  initiated by InitEvent as initevent
  terminated by by TermEvent(id=initevent.id)

Define the statement that filters:

context GeoEventFilterContext select * from GeoEvent(id = context.initevent.id, 
  value between context.initevent.startRange and context.initevent.endRange, 
  marker in (context.initevent.markers))

Your application can now send InitEvent instances, for example (notation from the online EPL tool):

InitEvent={id='0001', startRange=10, endRange=20, markers={'a', 'b'}}
InitEvent={id='0002', startRange=20, endRange=30, markers={'c', 'd'}}

When the runtime receives an InitEvent instance, it can simply take the id, startRange, endRange and markers values and instantiate the EPL filter statement (aka. allocate a new context partition) and start looking for matching GeoEvent events.

To stop looking for a given id, send a TermEvent, like so:

TermEvent={id='0001'}

22.2.37. Comparing Single-Threaded and Multi-Threaded Performance

The Java Virtual Machine optimizes locks such that the time to obtain a read lock, for example, differs widely between single-threaded and multi-threaded applications. We compared code that obtains an unfair ReentrantReadWriteLock read lock 100 million times, without any writer. We measured 3 seconds for a single-threaded application and 15 seconds for an application with 2 threads. It can therefore not be expected that scaling from single-threaded to 2 threads will always double performance. There is a base cost for multiple threads to coordinate.

22.2.38. Incremental Versus Recomputed Aggregation for Named Window Events

Whether aggregations of named window rows are computed incrementally or are recomputed from scratch depends on the type of statement.

When the runtime computes aggregation values incrementally, meaning it continuously updates the aggregation value as events enter and leave a named window, it means that the runtime internally subscribes to named window updates and applies these updates as they occur. For some applications this is the desired behavior.

For some applications re-computing aggregation values from scratch when a certain condition occurs, for example when a triggering event arrives or time passes, is beneficial. Re-computing an aggregation can be less expensive if the number of rows to consider is small and/or when the triggering event or time condition triggers infrequently.

The next paragraph assumes that a named window has been created to hold some historical financial data per symbol and minute:

create window HistoricalWindow#keepall as (symbol string, int minute, double price)

insert into HistoricalWindow select symbol, minute, price from HistoricalTick

For statements that simply select from a named window (excludes on-select) the runtime computes aggregation values incrementally, continuously updating the aggregation, as events enter and leave the named window.

For example, the below statement updates the total price incrementally as events enter and leave the named window. If events in the named window already exist at the time the statement gets created, the total price gets pre-computed once when the statement gets created and incrementally updated when events enter and leave the named window:

select sum(price) from HistoricalWindow(symbol='GE')

The same is true for uncorrelated subqueries. For statements that sub-select from a named window, the runtime computes aggregation values incrementally, continuously updating the aggregation, as events enter and leave the named window. This is only true for uncorrelated subqueries that don't have a where-clause.

// Output GE symbol total price, incrementally computed
// Outputs every 15 minutes on the hour.
select (sum(price) from HistoricalWindow(symbol='GE')) 
from pattern [every timer:at(0, 15, 30, 45), *, *, *, *, 0)]

If instead your application uses on-select or a correlated subquery, the runtime recomputes aggregation values from scratch every time the triggering event fires.

For example, the below statement does not incrementally compute the total price (use a plain select or subselect as above instead). Instead the runtime computes the total price from scratch based on the where-clause and matching rows:

// Output GE symbol total price (recomputed from scratch) every 15 minutes on the hour
on pattern [every timer:at(0, 15, 30, 45), *, *, *, *, 0)]
select sum(price) from HistoricalWindow where symbol='GE'

Unidirectional joins against named windows also do not incrementally compute aggregation values.

Joins and outer joins, that are not unidirectional, compute aggregation values incrementally.

22.2.39. When Does Memory Get Released

Java Virtual Machines (JVMs) release memory only when a garbage collection occurs. Depending on your JVM settings a garbage collection can occur frequently or infrequently and may consider all or only parts of heap memory.

The runtime is optimized towards latency and throughput. The runtime does not force garbage collection or interfere with garbage collection. For performance-sensitive code areas, the runtime utilizes thread-local buffers such as arrays or ringbuffers that can retain small amounts of recently processed state. The runtime does not try to clean such buffers after every event for performance reasons. It does clean such buffers when destroying the runtime and undeploying. It is therefore normal to see a small non-increasing amount of memory to be retained after processing events that the garbage collector may not free immediately.

22.2.40. Measure throughput of non-matches as well as matches

When an event comes in and the event does not match any statement, the runtime can discard that event since the event is a non-match. When measuring throughput, we suggest including non-matching events. The fact that the runtime can discard non-matching events extremely fast is an important aspect of processing.

Many use cases look for a needle-in-a-haystack situation or rarely occurring pattern. For example, a use case looking for security breaches may analyze 10 million events and find only a single situation consisting, for example, of 5 correlated events of the 10 million input events. We'd recommend your benchmark to closely mimic or to play back production data and watch the expected ratio of input and output events. Reducing the number of output events generally increases performance.

For example, assume you have 10 statements:

select * from pattern[A -> B(id = 1)];
select * from pattern[A -> B(id = 2)];
.....
select * from pattern[A -> B(id = 10)];

The above patterns each match once when an A event comes in followed by a B event with a given id between 1 and 10.

We recommend to measure throughput by sending in B events that have a value of minus one (-1) for id, for example, to determine how fast such events are discarded.

22.3. Using the Performance Kit

22.3.1. How to Use the Performance Kit

The benchmark application is basically an event server build with the runtime that listens to remote clients over TCP. Remote clients send MarketData(ticker, price, volume) streams to the event server. The event server is started with 1000 statements of one single kind (unless otherwise written), with one statement per ticker symbol, unless the statement kind does not depend on the symbol. The statement prototype is provided along the results with a '$' instead of the actual ticker symbol value. The event server is entirely multithreaded and can leverage the full power of 32bit or 64bit underlying hardware multi-processor multi-core architecture.

The kit also prints out when starting up the event size and the theoretical maximal throughput you can get on a 100 Mbit/s and 1 Gbit/s network. Keep in mind a 100 Mbit/s network will be overloaded at about 400 000 event/s when using our kit despite the small size of events.

Results are posted on our Wiki page at Performance Wiki. Reported results do not represent best ever obtained results. Reported results may help you better compare Esper to other solutions (for latency, throughput and CPU utilization) and also assess your target hardware and JVMs.

The event server, client and statement prototypes are provided in the source repository esper/trunk/examples/benchmark/. Refer to http://www.espertech.com/esper for source access.

If you use the kit you should:

Choose the statement you want to benchmark, add it to etc/statements.properties under your own KEY and use the -mode KEY when you start the event server.
Prepare your runServer.sh/runServer.cmd and runClient.sh/runclient.cmd scripts. You'll need to drop required jar libraries in lib/ , make sure the classpath is configured in those script to include build and etc . The required libraries are Esper (any compatible version, we have tested started with Esper 1.7.0) and its dependencies. Note that ./etc and ./build have to be in the classpath. At that stage you should also start to set min and max JVM heap. A good start is 1GB as in -Xms1g -Xmx1g
Write the statement you want to benchmark given that client will send a stream MarketData(String ticker, int volume, double price), add it to etc/statements.properties under your own KEY and use the -mode KEY when you start the event server. Use '$' in the statement to create a prototype. For every symbol, a statement will get registered with all '$' replaced by the actual symbol value (f.e. 'GOOG')
Ensure client and server are using the same -Desper.benchmark.symbol=1000 value. This sets the number of symbol to use (thus may set the number of statement if you are using a statement prototype, and governs how MarketData event are represented over the network. Basically all events will have the same size over the network to ensure predictability and will be ranging between S0AA and S999A if you use 1000 as a value here (prefix with S and padded with A up to a fixed length string. Volume and price attributes will be randomized.
By default the benchmark registers a subscriber to the statement(s). Use -Desper.benchmark.ul to use an UpdateListener instead. Note that the subscriber contains suitable update(..) methods for the default proposed statement in the etc/statements.properties file but might not be suitable if you change statements due to the strong binding with statement results. Refer to Table 15.2, “Choices For Receiving Statement Results”.
Establish a performance baseline in simulation mode (without clients). Use the -rate 1x5000 option to simulate one client (one thread) sending 5000 evt/s. You can ramp up both the number of client simulated thread and their emission rate to maximize CPU utilization. The right number should mimic the client emission rate you will use in the client/server benchmark and should thus be consistent with what your client machine and network will be able to send. On small hardware, having a lot of thread with slow rate will not help getting high throughput in this simulation mode.

Do performance runs with client/server mode. Remove the -rate NxM option from the runServer script or Ant task. Start the server with -help to display the possible server options (listen port, statistics, fan out options etc). On the remote machine, start one or more client. Use -help to display the possible client options (remote port, host, emission rate). The client will output the actual number of event it is sending to the server. If the server gets overloaded (or if you turned on -queue options on the server) the client will likely not be able to reach its target rate.

Usually you will get better performance by using server side -queue -1 option so as to have each client connection handled by a single thread pipeline. If you change to 0 or more, there will be intermediate structures to pass the event stream in an asynchronous fashion. This will increase context switching, although if you are using many clients, or are using the -sleep xxx (xxx in milliseconds) to simulate a listener delay you may get better performance.

The most important server side option is -stat xxx (xxx in seconds) to print out throughput and latency statistics aggregated over the last xxx seconds (and reset every time). It will produce both internal latency (in nanosecond) and also end to end latency (in millisecond, including network time). If you are measuring end to end latency you should make sure your server and client machine(s) are having the same time with f.e. ntpd with a good enough precision. The stat format is like:

---Stats - runtime (unit: ns)
  Avg: 2528 #4101107
        0 <    5000:  97.01%  97.01% #3978672
     5000 <   10000:   2.60%  99.62% #106669
    10000 <   15000:   0.35%  99.97% #14337
    15000 <   20000:   0.02%  99.99% #971
    20000 <   25000:   0.00%  99.99% #177
    25000 <   50000:   0.00% 100.00% #89
    50000 <  100000:   0.00% 100.00% #41
   100000 <  500000:   0.00% 100.00% #120
   500000 < 1000000:   0.00% 100.00% #2
  1000000 < 2500000:   0.00% 100.00% #7
  2500000 < 5000000:   0.00% 100.00% #5
  5000000 <    more:   0.00% 100.00% #18
---Stats - endToEnd (unit: ms)
  Avg: -2704829444341073400 #4101609
        0 <       1:  75.01%  75.01% #3076609
        1 <       5:   0.00%  75.01% #0
        5 <      10:   0.00%  75.01% #0
       10 <      50:   0.00%  75.01% #0
       50 <     100:   0.00%  75.01% #0
      100 <     250:   0.00%  75.01% #0
      250 <     500:   0.00%  75.01% #0
      500 <    1000:   0.00%  75.01% #0
     1000 <    more:  24.99% 100.00% #1025000
Throughput 412503 (active 0 pending 0 cnx 4)

This one reads as:

"Throughput is 412 503 event/s with 4 client connected. No -queue options 
was used thus no event is pending at the time the statistics are printed. 
latency average is at 2528 ns (that is 2.5 us) for 4 101 107 events 
(which means we have 10 seconds stats here). Less than 10us latency 
was achieved for 106 669 events that is 99.62%. Latency between 5us 
and 10us was achieved for those 2.60% of all the events in the interval."

"End to end latency was ... in this case likely due to client clock difference
we ended up with unusable end to end statistics."

Consider the second output paragraph on end-to-end latency:

---Stats - endToEnd (unit: ms)
  Avg: 15 #863396
        0 <       1:   0.75%   0.75% #6434
        1 <       5:   0.99%   1.74% #8552
        5 <      10:   2.12%   3.85% #18269
       10 <      50:  91.27%  95.13% #788062
       50 <     100:   0.10%  95.32% #827
      100 <     250:   4.36%  99.58% #37634
      250 <     500:   0.42% 100.00% #3618
      500 <    1000:   0.00% 100.00% #0
     1000 <    more:   0.00% 100.00% #0

This would read:

"End to end latency average is at 15 milliseconds for the 863 396 events 
considered for this statistic report. 95.13% ie 788 062 events were handled 
(end to end) below 50ms, and 91.27% were handled between 10ms and 50ms."

Chapter 23. References

23.1. Reference List

23.1. Reference List

Luckham, David. 2002. The Power of Events. Addison-Wesley.
The Stanford Rapide (TM) Project. http://pavg.stanford.edu/rapide.
Arasu, Arvind, et.al.. 2004. Linear Road: A Stream Data Management Benchmark, Stanford University http://www.cs.brown.edu/research/aurora/Linear_Road_Benchmark_Homepage.html.

Appendix A. Output Reference and Samples

This section specifies the output of a subset of statements, for two purposes: First, to help application developers understand streaming runtime output in response to incoming events and in response to time passing. Second, to document and standardize output for statements in a testable and trackable fashion.

The section focuses on a subset of features, namely the time window, aggregation, grouping, and output rate limiting. The section does not currently provide examples for many of the other language features, thus there is no example for other data windows (the time window is used here), joins, sub-selects or named windows etc.

Rather then just describe syntax and output, this section provides detailed examples for each of the types of statements presented. The input for each type of statement is always the same set of events, and the same timing. Each event has three properties: symbol, volume and price. The property types are string, long and double, respectively.

The chapters are organized by the type of statement: The presence or absence of aggregation functions, as well as the presence or absence of a group by clause change statement output as described in Section 2.15, “Basic Aggregated Statement Types”.

You will notice that some statements utilize the order by clause for sorting output. The reason is that when multiple output rows are produced at once, the output can be easier to read if it is sorted.

With output rate limiting, the runtime invokes your listener even if there are no results to indicate when the output condition has been reached. Such is indicated as (empty result) in the output result columns.

The output columns show both insert and remove stream events. Insert stream events are delivered as an array of EventBean instances to listeners in the newData parameter, while remove stream events are delivered to the oldData parameter of listeners. Delivery to observers follows similar rules.

A.1. Introduction and Sample Data

For the purpose of illustration and documentation, the example data set demonstrates input and remove streams based on a time window of a 5.5 second interval. The statement utilizing the time window could look as follows:

select symbol, volume, price from MarketData#time(5.5 sec)

We have picked a time window to demonstrate the output for events entering and leaving a data window with an expiration policy. The time window provides a simple expiration policy based on time: if an event resides in the time window more then 5.5 seconds, the runtime expires the event from the time window.

The input events and their timing are below. The table should be read, starting from top, as "The time starts at 0.2 seconds. Event E1 arrives at 0.2 seconds with properties [S1, 100, 25]. At 0.8 second event E2 arrives with properties [S2, 5000, 9.0]" and so on.

                       Input                                 
-----------------------------------------------  
 Time Symbol  Volume   Price
  0.2                          
          S1     100    25.0   Event E1 arrives
  0.8                          
          S2    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
  1.5                          
          S1     150    24.0   Event E3 arrives
          S3   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
          S1     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
          S3   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
  4.3                          
          S1     150    22.0   Event E7 arrives
  4.9                          
          S3   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
  5.9                          
          S3   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2

The event data set assumes a time window of 5.5 seconds. Thus at time 5.7 seconds the first arriving event (E1) leaves the time window.

The data set as above shows times between 0.2 seconds and 7.2 seconds. Only a couple of time points have been picked for the table to keep the set of time points constant between statements, and thus make the test data and output easier to understand.

A.2. Output for Un-Aggregated and Un-Grouped Statements

This chapter provides sample output for statements that do not have aggregation functions and do not have a group by clause.

A.2.1. No Output Rate Limiting

Without an output clause, the runtime dispatches to listeners as soon as events arrive, or as soon as time passes such that events leave data windows.

select irstream symbol, volume, price from MarketData#time(5.5 sec)

With an output clause, the runtime dispatches to listeners when the output condition occurs. Here, the output condition is a 1-second time interval. The runtime thus outputs every 1 second, starting from the first event, even if there are no new events or no expiring events to output.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 100, 25.0]                       
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 5000, 9.0]                      
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 150, 24.0]                       
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 10000, 1.0]                      
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [IBM, 155, 26.0]                       
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 11000, 2.0]                      
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 150, 22.0]                       
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 11500, 3.0]                      
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                                    [IBM, 100, 25.0]    
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 10500, 1.0]                      
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                                    [MSFT, 5000, 9.0]   
  7.0                          Event E3 and E4 leave the time window
                                                                    [IBM, 150, 24.0]    
                                                                    [YAH, 10000, 1.0]   
  7.2

A.2.2. Output Rate Limiting - Default

The default (no keyword) and the ALL keyword result in the same output.

select irstream symbol, volume, price from MarketData#time(5.5 sec) 
output every 1 seconds

Using the LAST keyword in the output clause, the runtime dispatches to listeners only the last event of each insert and remove stream.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 11000, 2.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 150, 22.0]                       
                                                 [YAH, 11500, 3.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 10500, 1.0]  [IBM, 100, 25.0]    
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [MSFT, 5000, 9.0]   
                                                                    [IBM, 150, 24.0]    
                                                                    [YAH, 10000, 1.0]

A.2.3. Output Rate Limiting - Last

select irstream symbol, volume, price from MarketData#time(5.5 sec) 
output last every 1 seconds

Using the FIRST keyword in the output clause, the runtime dispatches to listeners only the first event of each insert or remove stream, and does not output further events until the output condition is reached.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 155, 26.0]                       
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 11000, 2.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [YAH, 11500, 3.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 10500, 1.0]  [IBM, 100, 25.0]    
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [YAH, 10000, 1.0]

A.2.4. Output Rate Limiting - First

select irstream symbol, volume, price from MarketData#time(5.5 sec)
output first every 1 seconds

Using the SNAPSHOT keyword in the output clause, the runtime posts data window contents when the output condition is reached.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 100, 25.0]                       
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 150, 24.0]                       
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 11000, 2.0]                      
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 150, 22.0]                       
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                                    [IBM, 100, 25.0]    
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                                    [MSFT, 5000, 9.0]   
  7.0                          Event E3 and E4 leave the time window
  7.2

A.2.5. Output Rate Limiting - Snapshot

select irstream symbol, volume, price from MarketData#time(5.5 sec)
output snapshot every 1 seconds

This chapter provides sample output for statements that have aggregation functions, and that do not have a group by clause, and in which all event properties are under aggregation.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
                                                 [YAH, 11000, 2.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
                                                 [YAH, 11000, 2.0]                      
                                                 [IBM, 150, 22.0]                       
                                                 [YAH, 11500, 3.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 24.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 26.0]                       
                                                 [YAH, 11000, 2.0]                      
                                                 [IBM, 150, 22.0]                       
                                                 [YAH, 11500, 3.0]                      
                                                 [YAH, 10500, 1.0]                      
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 155, 26.0]                       
                                                 [YAH, 11000, 2.0]                      
                                                 [IBM, 150, 22.0]                       
                                                 [YAH, 11500, 3.0]                      
                                                 [YAH, 10500, 1.0]

A.3. Output for Fully-Aggregated and Un-Grouped Statements

A.3.1. No Output Rate Limiting

select irstream sum(price) from MarketData#time(5.5 sec)

Output occurs when the output condition is reached after each 1-second time interval. For each event arriving, the new aggregation value is output as part of the insert stream. As part of the remove stream, the prior aggregation value is output. This is useful for getting a delta-change for each event or group. If there is a having clause, the filter expression applies to each row.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [25.0]             [null]              
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [34.0]             [25.0]              
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [58.0]             [34.0]              
         YAH   10000     1.0   Event E4 arrives
                                                 [59.0]             [58.0]              
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [85.0]             [59.0]              
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [87.0]             [85.0]              
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [109.0]            [87.0]              
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [112.0]            [109.0]             
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [87.0]             [112.0]             
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [88.0]             [87.0]              
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [79.0]             [88.0]              
  7.0                          Event E3 and E4 leave the time window
                                                 [54.0]             [79.0]              
  7.2

A.3.2. Output Rate Limiting - Default

Here also the default (no keyword) and the ALL keyword result in the same output.

select irstream sum(price) from MarketData#time(5.5 sec)
output every 1 seconds

With the LAST keyword, the insert stream carries one event that holds the last aggregation value, and the remove stream carries the prior aggregation value.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [25.0]             [null]              
                                                 [34.0]             [25.0]              
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [58.0]             [34.0]              
                                                 [59.0]             [58.0]              
                                                 [85.0]             [59.0]              
  2.5                          
  3.0                          
  3.2                          
                                                 [85.0]             [85.0]              
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [87.0]             [85.0]              
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [109.0]            [87.0]              
                                                 [112.0]            [109.0]             
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [87.0]             [112.0]             
                                                 [88.0]             [87.0]              
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [79.0]             [88.0]              
                                                 [54.0]             [79.0]

A.3.3. Output Rate Limiting - Last

select irstream sum(price) from MarketData#time(5.5 sec) 
output last every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [34.0]             [null]              
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [85.0]             [34.0]              
  2.5                          
  3.0                          
  3.2                          
                                                 [85.0]             [85.0]              
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [87.0]             [85.0]              
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [112.0]            [87.0]              
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [88.0]             [112.0]             
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [54.0]             [88.0]

A.3.4. Output Rate Limiting - First

select irstream sum(price) from MarketData#time(5.5 sec)
output first every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [25.0]             [null]              
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [58.0]             [34.0]              
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [87.0]             [85.0]              
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [109.0]            [87.0]              
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [87.0]             [112.0]             
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [79.0]             [88.0]              
  7.0                          Event E3 and E4 leave the time window
  7.2

A.3.5. Output Rate Limiting - Snapshot

select irstream sum(price) from MarketData#time(5.5 sec)
output snapshot every 1 seconds

This chapter provides sample output for statements that have aggregation functions, and that do not have a group by clause, and in which there are event properties that are not under aggregation.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [34.0]                                 
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [85.0]                                 
  2.5                          
  3.0                          
  3.2                          
                                                 [85.0]                                 
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [87.0]                                 
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [112.0]                                
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [88.0]                                 
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [54.0]

A.4. Output for Aggregated and Un-Grouped Statements

A.4.1. No Output Rate Limiting

select irstream symbol, sum(price) from MarketData#time(5.5 sec)

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]                            
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 34.0]                           
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 58.0]                            
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 59.0]                            
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [IBM, 85.0]                            
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 87.0]                            
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 109.0]                           
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 112.0]                           
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                                    [IBM, 87.0]         
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 88.0]                            
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                                    [MSFT, 79.0]        
  7.0                          Event E3 and E4 leave the time window
                                                                    [IBM, 54.0]         
                                                                    [YAH, 54.0]         
  7.2

A.4.2. Output Rate Limiting - Default

select irstream symbol, sum(price) from MarketData#time(5.5 sec)
output every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]                            
                                                 [MSFT, 34.0]                           
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 58.0]                            
                                                 [YAH, 59.0]                            
                                                 [IBM, 85.0]                            
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 87.0]                            
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 109.0]                           
                                                 [YAH, 112.0]                           
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 88.0]        [IBM, 87.0]         
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [MSFT, 79.0]        
                                                                    [IBM, 54.0]         
                                                                    [YAH, 54.0]

A.4.3. Output Rate Limiting - Last

select irstream symbol, sum(price) from MarketData#time(5.5 sec) 
output last every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [MSFT, 34.0]                           
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 85.0]                            
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 87.0]                            
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [YAH, 112.0]                           
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 88.0]        [IBM, 87.0]         
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [YAH, 54.0]

A.4.4. Output Rate Limiting - First

select irstream symbol, sum(price) from MarketData#time(5.5 sec)
output first every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]                            
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 58.0]                            
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 87.0]                            
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 109.0]                           
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                                    [IBM, 87.0]         
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                                    [MSFT, 79.0]        
  7.0                          Event E3 and E4 leave the time window
  7.2

A.4.5. Output Rate Limiting - Snapshot

select irstream symbol, sum(price) from MarketData#time(5.5 sec)
output snapshot every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 34.0]                            
                                                 [MSFT, 34.0]                           
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 85.0]                            
                                                 [MSFT, 85.0]                           
                                                 [IBM, 85.0]                            
                                                 [YAH, 85.0]                            
                                                 [IBM, 85.0]                            
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 85.0]                            
                                                 [MSFT, 85.0]                           
                                                 [IBM, 85.0]                            
                                                 [YAH, 85.0]                            
                                                 [IBM, 85.0]                            
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 87.0]                            
                                                 [MSFT, 87.0]                           
                                                 [IBM, 87.0]                            
                                                 [YAH, 87.0]                            
                                                 [IBM, 87.0]                            
                                                 [YAH, 87.0]                            
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 112.0]                           
                                                 [MSFT, 112.0]                          
                                                 [IBM, 112.0]                           
                                                 [YAH, 112.0]                           
                                                 [IBM, 112.0]                           
                                                 [YAH, 112.0]                           
                                                 [IBM, 112.0]                           
                                                 [YAH, 112.0]                           
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [MSFT, 88.0]                           
                                                 [IBM, 88.0]                            
                                                 [YAH, 88.0]                            
                                                 [IBM, 88.0]                            
                                                 [YAH, 88.0]                            
                                                 [IBM, 88.0]                            
                                                 [YAH, 88.0]                            
                                                 [YAH, 88.0]                            
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 54.0]                            
                                                 [YAH, 54.0]                            
                                                 [IBM, 54.0]                            
                                                 [YAH, 54.0]                            
                                                 [YAH, 54.0]

A.5. Output for Fully-Aggregated and Grouped Statements

A.5.1. No Output Rate Limiting

select irstream symbol, sum(price) from MarketData#time(5.5 sec) 
group by symbol
order by symbol

The default (no keyword) and the ALL keyword do not result in the same output. The default generates an output row per input event, while the ALL keyword generates a row for all groups.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]        [IBM, null]         
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 49.0]        [IBM, 25.0]         
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 1.0]         [YAH, null]         
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [IBM, 75.0]        [IBM, 49.0]         
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 3.0]         [YAH, 1.0]          
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 97.0]        [IBM, 75.0]         
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 6.0]         [YAH, 3.0]          
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [IBM, 72.0]        [IBM, 97.0]         
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 7.0]         [YAH, 6.0]          
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [MSFT, null]       [MSFT, 9.0]         
  7.0                          Event E3 and E4 leave the time window
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
  7.2

A.5.2. Output Rate Limiting - Default

select irstream symbol, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output every 1 seconds

			                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 49.0]        [IBM, 25.0]         
                                                 [YAH, 1.0]         [YAH, null]         
                                                 [IBM, 75.0]        [IBM, 49.0]         
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 3.0]         [YAH, 1.0]          
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [YAH, 6.0]         [YAH, 3.0]          
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [YAH, 7.0]         [YAH, 6.0]          
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [IBM, 48.0]        [IBM, 72.0]

A.5.3. Output Rate Limiting - All

select irstream symbol, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output all every 1 seconds 
order by symbol

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]        [IBM, 25.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 1.0]         [YAH, null]         
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 75.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 1.0]         [YAH, 1.0]          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 75.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 3.0]         [YAH, 1.0]          
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 3.0]          
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 7.0]         [YAH, 6.0]          
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]

A.5.4. Output Rate Limiting - Last

select irstream symbol, sum(price) from MarketData#time(5.5 sec)
group by symbol 
output last every 1 seconds 
order by symbol

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]        [IBM, 25.0]         
                                                 [YAH, 1.0]         [YAH, null]         
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 3.0]         [YAH, 1.0]          
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [YAH, 6.0]         [YAH, 3.0]          
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [YAH, 7.0]         [YAH, 6.0]          
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]

A.5.5. Output Rate Limiting - First

select irstream symbol, sum(price) from MarketData#time(5.5 sec)
group by symbol
output first every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]        [IBM, null]         
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 49.0]        [IBM, 25.0]         
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 1.0]         [YAH, null]         
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 3.0]         [YAH, 1.0]          
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 97.0]        [IBM, 75.0]         
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 6.0]         [YAH, 3.0]          
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [IBM, 72.0]        [IBM, 97.0]         
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 7.0]         [YAH, 6.0]          
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [MSFT, null]       [MSFT, 9.0]         
  7.0                          Event E3 and E4 leave the time window
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
  7.2

A.5.6. Output Rate Limiting - Snapshot

select irstream symbol, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output snapshot every 1 seconds 
order by symbol

This chapter provides sample output for statements that have aggregation functions, and that have a group by clause, and in which some event properties are not under aggregation.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]                            
                                                 [MSFT, 9.0]                            
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 1.0]                             
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 1.0]                             
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 3.0]                             
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 6.0]                             
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 7.0]                             
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 48.0]                            
                                                 [YAH, 6.0]

A.6. Output for Aggregated and Grouped Statements

A.6.1. No Output Rate Limiting

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec) group by symbol

The default (no keyword) and the ALL keyword do not result in the same output. The default generates an output row per input event, while the ALL keyword generates a row for all groups based on the last new event for each group.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 100, 25.0]                       
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 5000, 9.0]                      
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 150, 49.0]                       
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 10000, 1.0]                      
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [IBM, 155, 75.0]                       
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 11000, 3.0]                      
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 150, 97.0]                       
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 11500, 6.0]                      
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                                    [IBM, 100, 72.0]    
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 10500, 7.0]                      
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                                    [MSFT, 5000, null]  
  7.0                          Event E3 and E4 leave the time window
                                                                    [IBM, 150, 48.0]    
                                                                    [YAH, 10000, 6.0]   
  7.2

A.6.2. Output Rate Limiting - Default

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 150, 49.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 75.0]                       
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 11000, 3.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 150, 97.0]                       
                                                 [YAH, 11500, 6.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 10500, 7.0]  [IBM, 100, 72.0]    
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [MSFT, 5000, null]  
                                                                    [IBM, 150, 48.0]    
                                                                    [YAH, 10000, 6.0]

A.6.3. Output Rate Limiting - All

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output all every 1 seconds 
order by symbol

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 150, 49.0]                       
                                                 [IBM, 155, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [YAH, 10000, 1.0]                      
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 155, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [YAH, 10000, 1.0]                      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 155, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [YAH, 11000, 3.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 150, 97.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [YAH, 11500, 6.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 150, 72.0]   [IBM, 100, 72.0]    
                                                 [MSFT, 5000, 9.0]                      
                                                 [YAH, 10500, 7.0]                      
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 150, 48.0]   [IBM, 150, 48.0]    
                                                 [MSFT, 5000, null] [MSFT, 5000, null]  
                                                 [YAH, 10500, 6.0]  [YAH, 10000, 6.0]

A.6.4. Output Rate Limiting - Last

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec)
group by symbol 
output last every 1 seconds 
order by symbol

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 155, 75.0]                       
                                                 [YAH, 10000, 1.0]                      
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 11000, 3.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 150, 97.0]                       
                                                 [YAH, 11500, 6.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [YAH, 10500, 7.0]  [IBM, 100, 72.0]    
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                                    [IBM, 150, 48.0]    
                                                                    [MSFT, 5000, null]  
                                                                    [YAH, 10000, 6.0]

A.6.5. Output Rate Limiting - First

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output first every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 100, 25.0]                       
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 5000, 9.0]                      
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 150, 49.0]                       
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 10000, 1.0]                      
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 11000, 3.0]                      
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 150, 97.0]                       
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 11500, 6.0]                      
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [IBM, 100, 72.0]                       
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 10500, 7.0]                      
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [MSFT, 5000, null]                     
  7.0                          Event E3 and E4 leave the time window
                                                 [IBM, 150, 48.0]                       
                                                 [YAH, 10000, 6.0]                      
  7.2

A.6.6. Output Rate Limiting - Snapshot

select irstream symbol, volume, sum(price) from MarketData#time(5.5 sec) 
group by symbol 
output snapshot every 1 seconds

This chapter provides sample output for statements that have aggregation functions, and that have a group by clause, and in which all event properties are under aggregation or appear in the group by clause, and the group by clause has a rollup, cube or grouping sets keyword(s) instructing the runtime to perform multi-level aggregation.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 100, 25.0]                       
                                                 [MSFT, 5000, 9.0]                      
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 100, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 75.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 75.0]                       
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 100, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 75.0]                       
                                                 [YAH, 10000, 1.0]                      
                                                 [IBM, 155, 75.0]                       
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 100, 75.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 75.0]                       
                                                 [YAH, 10000, 3.0]                      
                                                 [IBM, 155, 75.0]                       
                                                 [YAH, 11000, 3.0]                      
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 100, 97.0]                       
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 97.0]                       
                                                 [YAH, 10000, 6.0]                      
                                                 [IBM, 155, 97.0]                       
                                                 [YAH, 11000, 6.0]                      
                                                 [IBM, 150, 97.0]                       
                                                 [YAH, 11500, 6.0]                      
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [MSFT, 5000, 9.0]                      
                                                 [IBM, 150, 72.0]                       
                                                 [YAH, 10000, 7.0]                      
                                                 [IBM, 155, 72.0]                       
                                                 [YAH, 11000, 7.0]                      
                                                 [IBM, 150, 72.0]                       
                                                 [YAH, 11500, 7.0]                      
                                                 [YAH, 10500, 7.0]                      
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 155, 48.0]                       
                                                 [YAH, 11000, 6.0]                      
                                                 [IBM, 150, 48.0]                       
                                                 [YAH, 11500, 6.0]                      
                                                 [YAH, 10500, 6.0]

A.7. Output for Fully-Aggregated, Grouped Statements With Rollup

A.7.1. No Output Rate Limiting

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec)
group by rollup(symbol)

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [null, 25.0]       [null, null]        
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 9.0]        [MSFT, null]        
                                                 [null, 34.0]       [null, 25.0]        
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 49.0]        [IBM, 25.0]         
                                                 [null, 58.0]       [null, 34.0]        
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 1.0]         [YAH, null]         
                                                 [null, 59.0]       [null, 58.0]        
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
                                                 [IBM, 75.0]        [IBM, 49.0]         
                                                 [null, 85.0]       [null, 59.0]        
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 3.0]         [YAH, 1.0]          
                                                 [null, 87.0]       [null, 85.0]        
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [null, 109.0]      [null, 87.0]        
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 6.0]         [YAH, 3.0]          
                                                 [null, 112.0]      [null, 109.0]       
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [null, 87.0]       [null, 112.0]       
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 7.0]         [YAH, 6.0]          
                                                 [null, 88.0]       [null, 87.0]        
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [null, 79.0]       [null, 88.0]        
  7.0                          Event E3 and E4 leave the time window
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [null, 54.0]       [null, 79.0]        
  7.2

A.7.2. Output Rate Limiting - Default

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec) 
group by rollup(symbol)
output every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [null, 25.0]       [null, null]        
                                                 [MSFT, 9.0]        [MSFT, null]        
                                                 [null, 34.0]       [null, 25.0]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 49.0]        [IBM, 25.0]         
                                                 [null, 58.0]       [null, 34.0]        
                                                 [YAH, 1.0]         [YAH, null]         
                                                 [null, 59.0]       [null, 58.0]        
                                                 [IBM, 75.0]        [IBM, 49.0]         
                                                 [null, 85.0]       [null, 59.0]        
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 3.0]         [YAH, 1.0]          
                                                 [null, 87.0]       [null, 85.0]        
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [null, 109.0]      [null, 87.0]        
                                                 [YAH, 6.0]         [YAH, 3.0]          
                                                 [null, 112.0]      [null, 109.0]       
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [null, 87.0]       [null, 112.0]       
                                                 [YAH, 7.0]         [YAH, 6.0]          
                                                 [null, 88.0]       [null, 87.0]        
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [null, 79.0]       [null, 88.0]        
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [null, 54.0]       [null, 79.0]

A.7.3. Output Rate Limiting - All

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec) 
group by rollup(symbol)
output all every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [MSFT, 9.0]        [MSFT, null]        
                                                 [null, 34.0]       [null, null]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]        [IBM, 25.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 1.0]         [YAH, null]         
                                                 [null, 85.0]       [null, 34.0]        
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 75.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 1.0]         [YAH, 1.0]          
                                                 [null, 85.0]       [null, 85.0]        
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 75.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 3.0]         [YAH, 1.0]          
                                                 [null, 87.0]       [null, 85.0]        
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 3.0]          
                                                 [null, 112.0]      [null, 87.0]        
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [MSFT, 9.0]        [MSFT, 9.0]         
                                                 [YAH, 7.0]         [YAH, 6.0]          
                                                 [null, 88.0]       [null, 112.0]       
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [null, 54.0]       [null, 88.0]

A.7.4. Output Rate Limiting - Last

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec)
group by rollup(symbol)
output last every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [MSFT, 9.0]        [MSFT, null]        
                                                 [null, 34.0]       [null, null]        
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]        [IBM, 25.0]         
                                                 [YAH, 1.0]         [YAH, null]         
                                                 [null, 85.0]       [null, 34.0]        
  2.5                          
  3.0                          
  3.2                          
                                                 (empty result)     (empty result)      
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [YAH, 3.0]         [YAH, 1.0]          
                                                 [null, 87.0]       [null, 85.0]        
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]        [IBM, 75.0]         
                                                 [YAH, 6.0]         [YAH, 3.0]          
                                                 [null, 112.0]      [null, 87.0]        
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [IBM, 72.0]        [IBM, 97.0]         
                                                 [YAH, 7.0]         [YAH, 6.0]          
                                                 [null, 88.0]       [null, 112.0]       
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [MSFT, null]       [MSFT, 9.0]         
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [null, 54.0]       [null, 88.0]

A.7.5. Output Rate Limiting - First

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec) 
group by rollup(symbol) 
output first every 1 seconds

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
                                                 [IBM, 25.0]        [IBM, null]         
                                                 [null, 25.0]       [null, null]        
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
                                                 [MSFT, 9.0]        [MSFT, null]        
  1.0                          
  1.2                          
  1.5                          
         IBM     150    24.0   Event E3 arrives
                                                 [IBM, 49.0]        [IBM, 25.0]         
                                                 [null, 58.0]       [null, 34.0]        
         YAH   10000     1.0   Event E4 arrives
                                                 [YAH, 1.0]         [YAH, null]         
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
  2.5                          
  3.0                          
  3.2                          
  3.5                          
         YAH   11000     2.0   Event E6 arrives
                                                 [YAH, 3.0]         [YAH, 1.0]          
                                                 [null, 87.0]       [null, 85.0]        
  4.0                          
  4.2                          
  4.3                          
         IBM     150    22.0   Event E7 arrives
                                                 [IBM, 97.0]        [IBM, 75.0]         
  4.9                          
         YAH   11500     3.0   Event E8 arrives
                                                 [YAH, 6.0]         [YAH, 3.0]          
                                                 [null, 112.0]      [null, 109.0]       
  5.0                          
  5.2                          
  5.7                          Event E1 leaves the time window
                                                 [IBM, 72.0]        [IBM, 97.0]         
  5.9                          
         YAH   10500     1.0   Event E9 arrives
                                                 [YAH, 7.0]         [YAH, 6.0]          
                                                 [null, 88.0]       [null, 87.0]        
  6.0                          
  6.2                          
  6.3                          Event E2 leaves the time window
                                                 [MSFT, null]       [MSFT, 9.0]         
  7.0                          Event E3 and E4 leave the time window
                                                 [IBM, 48.0]        [IBM, 72.0]         
                                                 [YAH, 6.0]         [YAH, 7.0]          
                                                 [null, 54.0]       [null, 79.0]

A.7.6. Output Rate Limiting - Snapshot

select irstream symbol, volume, sum(price)
from MarketData#time(5.5 sec) 
group by rollup(symbol)
output snapshot every 1 seconds

Output rate limiting provides output events to your application in regular intervals. Between intervals, the runtime may use a buffer to hold data until the output condition is reached, as described below. If your application has high-volume streams, you may need to be mindful of the memory needs for buffers especially if the output condition triggers infrequently.

                       Input                                  Output
                                                 Insert Stream    Remove Stream
-----------------------------------------------  ----------------------------------
 Time Symbol  Volume   Price
  0.2                          
         IBM     100    25.0   Event E1 arrives
  0.8                          
        MSFT    5000     9.0   Event E2 arrives
  1.0                          
  1.2                          
                                                 [IBM, 25.0]                            
                                                 [MSFT, 9.0]                            
                                                 [null, 34.0]                           
  1.5                          
         IBM     150    24.0   Event E3 arrives
         YAH   10000     1.0   Event E4 arrives
  2.0                          
  2.1                          
         IBM     155    26.0   Event E5 arrives
  2.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 1.0]                             
                                                 [null, 85.0]                           
  2.5                          
  3.0                          
  3.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 1.0]                             
                                                 [null, 85.0]                           
  3.5                          
         YAH   11000     2.0   Event E6 arrives
  4.0                          
  4.2                          
                                                 [IBM, 75.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 3.0]                             
                                                 [null, 87.0]                           
  4.3                          
         IBM     150    22.0   Event E7 arrives
  4.9                          
         YAH   11500     3.0   Event E8 arrives
  5.0                          
  5.2                          
                                                 [IBM, 97.0]                            
                                                 [MSFT, 9.0]                            
                                                 [YAH, 6.0]                             
                                                 [null, 112.0]                          
  5.7                          Event E1 leaves the time window
  5.9                          
         YAH   10500     1.0   Event E9 arrives
  6.0                          
  6.2                          
                                                 [MSFT, 9.0]                            
                                                 [IBM, 72.0]                            
                                                 [YAH, 7.0]                             
                                                 [null, 88.0]                           
  6.3                          Event E2 leaves the time window
  7.0                          Event E3 and E4 leave the time window
  7.2                          
                                                 [IBM, 48.0]                            
                                                 [YAH, 6.0]                             
                                                 [null, 54.0]

Appendix B. Runtime Considerations for Output Rate Limiting

The output clause with the snapshot keyword does not require a buffer for any type of statement.

The output clause with the first keyword does not require a buffer for any type of statement.

We use the term change set to describe all insert and remove stream events that occur since the last triggering of the output condition.

You can override the default behavior for some types of statements by specifying a hint.

Please see Section 2.15, “Basic Aggregated Statement Types” for information on the types of statements discussed below.

B.1. For Un-Aggregated and Un-Grouped Statements

B.1.1. `Output Last`

For statements that define output last the runtime retains only the first remove stream event and the last insert stream event, both matching the having-clause, if present, to compute insert and remove stream output when the output condition triggers.

B.1.2. `Output All`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of any row the runtime applies the having-clause and retains only matching events, or retains all events if there is no having-clause.
Upon triggering of the output condition the runtime computes the insert and remove stream output events according to the select-clause for output.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.2. For Fully Aggregated and Un-Grouped Statements

B.2.1. `Output Last`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of the first row since the last triggering of the output condition the runtime computes the remove stream output event according to the select-clause for later output (when applicable).
Upon triggering of the output condition the runtime computes the insert stream output event according to the select-clause for output.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.2.2. `Output All`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of rows the runtime applies the having-clause and computes the insert and remove stream output event according to the select-clause for later output (when applicable).
Upon triggering of the output condition the runtime outputs the insert and remove stream output events.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.3. For Aggregated and Un-Grouped Statements

B.3.1. `Output Last`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of the first row since the last triggering of the output condition the runtime computes the insert and remove stream output event according to the having-clause (if present) and the select-clause for later output (when applicable), retaining only the last computed insert and remove stream output event.
Upon triggering of the output condition the runtime outputs the pre-computed last insert stream and remove stream output event.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.3.2. `Output All`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of rows the runtime computes the insert and remove stream output events according to the having-clause (if present) and the select-clause for later output, retaining only the computed insert and remove stream output events.
Upon triggering of the output condition the runtime outputs the retained output events.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.4. For Fully Aggregated and Grouped Statements (Includes Rollup)

B.4.1. `Output Last`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of the first row for a given group since the last triggering of the output condition the runtime computes the remove stream output event for that group according to the select-clause for later output (when applicable), and also retains a single insert stream event per group.
Upon triggering of the output condition the runtime uses the retained insert stream events per group to compute output events according to the select-clause.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.4.2. `Output All`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

The runtime retains, for each group, a row to represent the group.
Upon arrival of rows the runtime computes the remove stream output events according to the having-clause (if present) and the select-clause for later output.
Upon triggering of the output condition the runtime computes the insert stream output events according to the having-clause (if present) and the select-clause for output, for each group.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.5. For Aggregated and Grouped Statements

B.5.1. `Output Last`

With an order-by clause or when your EPL specifies the @Hint('disable_outputlimit_opt') hint:

Upon arrival of the first row for a given group since the last triggering of the output condition the runtime computes the insert and remove stream output event for that group according to the select-clause for later output (when applicable), and retains a last insert and remove stream event per group.
Upon triggering of the output condition the runtime outputs the retained insert and remove stream output events per group.

The runtime retains the change set and computes output from the change set at the time the output condition triggers, after which it discards the change set.

B.5.2. `Output All`