Skip to main content

Insights

Datumo offers a variety of insights on your data for a better understanding of the quality of your data.

So far, Datumo supports the following insights:

  • the number of contacts in the dataset;
  • the number of attributes in the dataset;
  • the number of cells in the dataset;
  • the number of filled cells in the dataset and the percentage of filled cells;
  • the number of filled cells for each attribute in the dataset and the percentage of filled cells for each attribute;
  • the segmentation of the categorical attributes values. Supported attributes are:
    • the segmentation of the Is human attribute values (if present), including the count and the percentage the attribute values;
    • the segmentation of the Gender attribute values (if present), including the count and the percentage the attribute values;
    • the segmentation of the Preferred Language attribute values (if present), including the count and the percentage the attribute values;
    • the segmentation of the Business Language attribute values (if present), including the count and the percentage the attribute values.

Requesting the insights

To request the insights, you need to have a collection with contacts data on Datumo. You can create a collection and upload your data following instructions in the Collection section.

Once you have uploaded your contacts data, you can request the insights on your contacts by sending a POST request to the invocation endpoint, with insights as invocationType. See more at Invoke Datumo.

Example

In the example below, we request insights for the collection my-collection-id.

curl --location 'https://nostradamus.u-hopper.com/api/invocation' \
--header 'Authorization: Token my-secret-token' \
--header 'Content-Type: application/json' \
--data '{
"collectionId": "my-collection-id",
"invocationType": "insights",
"args": null
}'

Interpreting the boosting results

The results of an insights request will provide you with the insights for understanding the quality of your data. The results will be available in json format and will include the following keys:

  • contacts: with value the number of contacts in the dataset;
  • attributes: with value the number of attributes in the dataset;
  • cells: with value the number of cells in the dataset;
  • filledCells: with value a dictionary with the keys count and percentage, indicating the number of filled cells and the percentage of filled cells;
  • filledCellsPerAttribute: with value a dictionary where each key is an attribute name and the value is a dictionary with the keys count and percentage, indicating the number of filled cells and the percentage of filled cells for each attribute;
  • categoricalAttributesSegmentation: with value a dictionary where each key is an attribute name and the value is a dictionary with the segmentation of the attribute values. The segmentation is a dictionary with the attribute values as keys and the count and percentage of the attribute values as values.

Example

You request a boosting for the following contacts:

ID,Name,Surname,Gender,Country,Preferred Language,Email,Company - Name
0,Silvia,Marri,female,it,it,silvi.marri@snrt.co.eu,SN RTek
1,,,female,it,it,toninal@nicojd.com,NicoJds

The output, in json format, will be:

{
"contacts": 2,
"attributes": 8,
"cells": 16,
"filledCells": {
"count": 14,
"percentage": 87.5
},
"filledCellsPerAttribute": {
"ID": {
"count": 2,
"percentage": 100.0
},
"Name": {
"count": 1,
"percentage": 50.0
},
"Surname": {
"count": 1,
"percentage": 50.0
},
"Gender": {
"count": 2,
"percentage": 100.0
},
"Country": {
"count": 2,
"percentage": 100.0
},
"Preferred Language": {
"count": 2,
"percentage": 100.0
},
"Email": {
"count": 2,
"percentage": 100.0
},
"Company - Name": {
"count": 2,
"percentage": 100.0
}
},
"categoricalAttributesSegmentation": {
"Gender": {
"female": {
"count": 2,
"percentage": 100.0
}
},
"Preferred Language": {
"it": {
"count": 2,
"percentage": 100.0
}
}
}
}