Announcements 9 Apr 2019

Siren Platform 10.2 released

Author: Giovanni Tummarello
Author Giovanni Tummarello
Siren Platform 10.2 released

Siren 10.2 released — free community edition, simplified data ingestion & extraction, Neo4j support & more

We announce today the availability of Siren 10.2. We’re very excited about this release for many reasons. Let’s go in order.

Siren 10.2 addresses the following challenges with several new features:

Introducing Siren Community Edition, our free entry tier

Since the start, Siren’s industry unique relational investigation capabilities have been widely appealing to the analyst community at large, but always required loading data into Elasticsearch or DBMS in order to analyze it.

With 10.2 we’re now introducing UI assisted ways to load data (e.g. from CSV/Excel see below) effectively making much easier to use Siren also for an individual analyst or a casual project.

This is the perfect occasion for us to introduce our new entry tier level: Siren Community Edition™!

Siren Community Edition™ is fully featured, subject only to quantitative limitations, and can be easily upgraded to our Siren IT and Siren Business editions.

Siren Community Edition represents a fundamental milestone for us, for the first time making world class link analysis and our unique relational investigation capabilities available to individuals and casual projects.

Making data loading easy: no-size-limit CSV and Excel Imports

Siren 10.2 introduces big scale, UI driven CSV and Excel uploads.

The feature is provided as part of the new “Data Reflection” app dedicated to data sources and data loading. It starts with a first step which is as easy as dragging and dropping a local file.

Siren Data Reflection - CSV Import
After a quick preview, one can configure Transformations and Mappings (data types), let’s see them in some detail.

Transform your data at load time.

Did you know, Elasticsearch has very nice data transformation capabilities in the form of “ingestion processors” (almost 30 of them) which can be pipelined as required. In its current version transformations are expressed in JSON still, but in Siren the environment is assistive and offers an immediate way to test the results on different inputs. Take a look below (just chose a sample or edit your input to see the result of the pipeline).

Data transformation at load time

Also, once one is happy with a transformation pipeline, this can be saved and reused later for similarly formatted inputs.

Also, on top of the standard Elasticsearch pipeline processors, we’re adding a quite useful “Web Service Enrichment” processor. In the example below, the “Abstract” field of the currently processed documents is sent as the “text” parameter to the NLP service.

Siren Web Service Enrichment Processor

UI driven Elasticsearch mappings

The wizard features also UI driven schema review and editing (mapping to Elasticsearch data types). Why is this important? Sure one can simply index all fields as “text” but if one uses the right types then more meaningful analytics (word-clouds, proper unique counts, numerical analysis, date histograms and easy data filters) will be possible.

Again the process is quite simple, with Siren showing samples of values and also making some educated guess whenever possible.

UI driven ES mappings

Once finished the CSV will have become an index, ready to be mapped as standard in the “Data Model” section of Siren.  CSVs have no specific size restrictions, in our tests we have successfully loaded well into the multi gigabyte.

Repeated, “workflow uploads” a breeze.

This feature has been developed for a casual data upload as well as serial “workflow” uploads where the formats are known a priori. In this case, thanks to presets which can be saved, the wizard makes ingestion a true drag and drop/single button activity.

Here is how one of our customers ingests fresh data from many well-known life science services, just selecting one of the previously saved configurations.

repeated workflow uploads in Siren

Data Reflections: kept in sync materialization of remote data

Last May with Siren 10 we launched remote data source virtualization (docs) — that is the ability to see remote JDBC data sources as if they were local Elasticsearch indexes, without having to copy data.

Virtualization gives Siren the ability to see in the same UI data from several physically different systems, without the need of copying it and no delays (live queries and responses are translated on the fly). For example, in the screenshot below, Siren is showing data from nine different virtualized back end sources.

Siren Data Reflection within a dashboard

While this is obviously great, there are a few things virtualization cannot do and use cases where it is not a good idea to continuously hit on a remote data source.

Enter data source reflections

Siren 10.2 launches the concept of optional ‘Reflection’ of remote data sources and virtual indexes into the (supercharged) Elasticsearch back end.  Siren Reflections are setup within the UI and they can be scheduled to be re-executed regularly.

As opposed to virtualized indexes, reflecting data comes with the following advantages:

Reflections can be activated from the new Data Reflection side app, with a workflow coherent with that of the CSV import (allowing pipeline imports, setting of mappings, etc.). Want to know more? Check out: The alternatives to ETL into Elasticsearch: Virtualize or Reflect?

Siren Data Reflections App

Big Data CSV/PDF and PNG exports

Gigabytes of data can be exported now with a click no matter which complex or relational filter is applied. Just click the button at the top left of the record tables.

Big data exports from within the Siren UI

Likewise, we’re happy to add one click PDF and PNG exports from the dashboards via the “Export” menu.

Siren Data Exports - PDF

PDF look pretty nice, with username, timestamps and full filters on top:

Siren PDF Export Example

Enhanced data table (With Pivot table capabilities)

The analytics table component can now split by column, achieving a typical “pivot table” appearance. On top of this one can optionally activate a text search bar, scripted columns with math expressions, and several appearance configuration options.

Enhanced data table within Siren Investigate™

Correlation Explorer

We have improved our parallel line correlation explorer and included it in the core distribution. Easy to explore how aggregate metrics and value buckets correlate, both creating filters on the axis and on the scatter plots.

Improved Autorelations and relation preview icon.

Siren ability to automatically find keys to join and suggest a relational data model creation got better in 10.2 with many more options and smarter defaults recognizing email addresses, IPs, and other kinds of data automatically.

This is typically a big help to analysts building their initial model to get much faster time to value by connecting more data sets.

We have added a quick “view” button to visually verify relations within the relation editor in the data model. Just press the button.

Correlation explorer within Siren Investigate™
…and see samples of field value matched records from the 2 indexes

Correlation Explorer - Matched records

Significant terms aggregation is now available in the graph browser, as an option in the “Advanced Relations” and the results are fascinating.

In our demo data set of companies mentioned by articles (whereas articles can co-mention 2 or more companies) these are the results out of the box by activating the aggregate relation, starting from Oracle and expanding on SAP. We can see that “PeopleSoft” is strongly related to Oracle and mentioned very significantly together SAP.

Numbers on the arrows represent the number of articles that share the terms and the other number the “significance score” (also reflected on the arrow size).

Siren Graph Browser with significant term aggregation

Elasticsearch version bump.

Siren compatibility with Elasticsearch has been bumped to 6.5.4

Welcome Graph DBs: Our phase 1 support for Neo4j

We’re very excited to introduce our first connector to the graph DB World: the Neo4j ingestion connector.

With Siren 10.2, one can create a Neo4j data source and use it in the ingestion UI to bring in “slices” of Neo4j into the Siren back end.

Doing this will give you all the benefits above (textual search, ultra fast analytics dashboards, link analysis and all) and the data will be kept in sync automatically, within the “reflection” framework.

Even more interestingly, one can make use of the unique power of graph pattern detection of Neo4j by saving the results of graph queries within Siren indexes and then using them in Siren workflows.

For example, let’s imagine we want to catch “suspect reviewers” on a public product review in the Neo4j movie review test database for example “people who put reviews of movies they have been starring in”.

Via the mechanism above, Siren can immediately visualize “suspect reviewers nodes” which can then be expanded to reveal the network.

Siren Data Model™
Stay tuned for a blog post specifically on our initial Siren / Neo4j integration.

Last but not least: Getting Started tutorial and Siren Community

Siren 10.2 also comes with a very cool new tutorial and an “Easy Start” download package, which will help you to investigate your own data quickly.

Initial reactions to the tutorial have been great, users have been able to use Siren to analyze their own data easily and quickly.

The tutorial, by the way, works great on Siren Community Edition™ too, so why not try it now?

Last but not least we’re also launching the Siren Community portal.

In the community we’ll be providing open support, but also discuss Investigative Intelligence use cases and applications, provide FAQs, links to resources and more.

Feel free to stop by and looking forward to hearing from you!

What’s Cooking? Heads up on the next release

Works in progress for 10.3 and 10.4 Features include:

Conclusions

10.2 introduces a number of features and improvements that make it easier than ever to access and use Siren. With this version we’re also rolling out our community tier and our community forum making it easier to both get started on the software and get help.  We look forward to hearing from you.


Also published on Medium.

OTHER AREAS

Explore our topics

Close