Amazon Connect Data Lake Best Practices | AWS White Paper Summary

  • Customer service is a crucial element of brand reputation and business success.
  • Contact centers are vital to enabling a two-way agent-customer interaction and essential to delivering a superior customer service experience. Conversely, a poor experience can lead to customer churn.
  • Organizations invest in omnichannel contact centers for a competitive edge in enhancing customer experience.
  • To get the most advanced analytics benefits, organizations need a robust platform and cost-effective solution to run a thriving contact center.
  • Amazon Web Services (AWS) provides customers with a comprehensive set of services and a scalable platform to ensure high availability, security, and resiliency of a data lake in the cloud.
  • This whitepaper outlines the best practices for architecting a contact center data lake with Amazon Connect.

  • The following figure shows the architecture of a traditional on-premises contact center. 1

  • The following figure shows a strategic approach to simplifying complex traditional contact center data spans across infrastructure, licensing, and maintenance environments into Amazon Connect. 2

  • A data lake is a centralized, curated, and secured repository that stores and governs all your structured and unstructured data in its native or transformed formats for analysis.

  • AWS delivers the breadth and depth of services to build a secure, scalable, comprehensive, and cost-effective data lake solution.
  • You can use the AWS services to ingest, store, find, process, and analyze data from a wide variety of sources.

Amazon Connect

  • Amazon Connect is an easy-to-use and cost-effective omnichannel cloud contact center.
  • You can get started with a fully managed cloud-based and artificial intelligence (AI) enabled contact center within minutes.
  • With the pay-as-you-go model, you pay only when the service is in use.
  • There is no infrastructure to manage or upfront costs.

  • Forrester Research Consulting conducted a Total Economic Impact (TEI) study on Amazon Connect and concluded a three-year financial impact on how Amazon Connect helps customers with significant cost savings, increased revenue, and improved agent productivity. Key findings include:

    • Reduction in cloud technology costs of $4.3 million
    • Subscription cost savings of 31%
    • Agent labor savings from reduced call volume of $4.6 million
    • Increased operating income by $2.6 million with enhanced customer experience
    • Return on investment (ROI) of 241%
  • Using Amazon Connect’s extensive set of published APIs, you can programmatically integrate with other AWS services and third-party systems, including customer relationship management (CRM) solutions and anti-fraud solutions.

  • The following figure shows a high-level Amazon Connect contact center architecture. 3

  • Amazon Connect provides a unified and seamless customer experience across multiple channels.
  • Along with voice and webchat, Amazon Connect integrates with Amazon Pinpoint and Amazon Simple Email Service (Amazon SES) to expand the contact center’s capability on text messages and email delivery.
  • Amazon Connect integrates with Apple Business Chat for Apple device users.

Data lake design principles

  • How do you collect, store, and analyze high-velocity data across various data types, including structured, unstructured, and semi-structured?
  • How do you store and share petabytes of data on-demand globally and cost-effectively?
  • How do you scale IT resources to support a high number of concurrent queries against your data and scale down automatically for cost savings?
  • How do your users view, search, and run queries on multiple data repositories today?
  • How do you derive future insights using historical data patterns and past scenarios?

Customer profiles

  • Amazon Connect Customer Profiles enables agents to deliver efficient and personalized customer service by importing customer information from various applications into a unified customer profile.
  • You can ingest customer data from homegrown or third-party applications such as Salesforce, ServiceNow, Zendesk, and Marketo into your Amazon Simple Storage Service (Amazon S3) data lake using pre-built connectors.

Contact trace record

  • It captures transactional metrics such as hold time, wait time, and agent interaction time in JavaScript Object Notation (JSON) format.
  • Amazon Connect aggregates CTR data to create metrics reporting. Data retention for CTR is 24 months upon contact initiation.
  • You can stream CTRs to Amazon Kinesis for extended retention and advanced analysis.
  • The CTR data model describes various event types available in CTRs.

Contact flow logs

  • It captures real-time events and metrics about how your customers interact with contact flows.
  • AWS CloudWatch creates a log group for each AWS Connect instance when you enable contact flow logging and include a set logging behavior block for contact flows.
  • Contact flow logs contain the contact flow ID, the customer’s contact ID, and the block’s actions.
  • Using contact flow logs, you can compare customer’s interactions with different contact flow versions or trace their interactions through each contact flow.
  • Contact flow logs help you debug and roll back contact flows to previous versions should any issues arise.

Contact Lens output files

  • Using natural language processing (NLP) and speech-to-text analytics, Contact Lens for Amazon Connect provides insights to analyze customer sentiment, identify conversations trends for product feedback, and compliance audits for standard greetings and sign-offs.
  • With advanced conversational search, you can perform a fast full-text search for relevant calls by sentiment scores and non-talk time to identify common utterances that result in positive or negative customer sentiment. Contact Lens automatically redacts sensitive personally identifiable information (PII) for data privacy.
  • Contact Lens stores metadata for call transcript, sentiment analysis, non-talk time, talk speed, interruptions, and categorization labels in Amazon S3. You can create custom visualization or machine learning (ML) models using data from Contact Lens and CTR stored in S3.

Agent events streams

  • It captures and store agent activity in S3 via Amazon Kinesis Data Streams.
  • You can create dashboards for near real-time agent reporting such as agent login, agent logout, agent connects with a contact and agent status change.
  • You can integrate agent event streams into workforce management (WFM) solutions for agent staffing management or configure alerts on specific agent activity.

Voice and chat recordings

  • Amazon Connect records a conversation only when a customer connects to an agent. When the contact disconnects, the call recordings are available in your S3 bucket, or accessible in the customer's contact trace record (CTR).
  • Amazon Connect redacts, encrypts, and stores voice and chat conversations between the agent and the contact in your S3 bucket for advanced analytics.

Third-party integration

  • When using AWS Partners or other third-party solutions with Amazon Connect, you can consolidate logs and external data sources in Amazon S3.

Data lake lifecycle

Building a data lake typically involves five stages:

  • Setting up storage
  • Moving data
  • Preparing and cataloging data
  • Configuring security policies
  • Making data available for consumption

1

  • AWS provides a comprehensive data transfer services portfolio to move your existing data into a centralized data lake.
  • Amazon Storage Gateway and AWS Direct Connect can address hybrid cloud storage needs.
  • For online data transfer, consider using AWS DataSync and Amazon Kinesis.
  • Use the AWS Snow Family for offline data transfer.

Cataloging

  • AWS Lake Formation can manage data ingestion via AWS Glue by automatically classifying data and storing definitions, schema, and metadata in a central data catalog.
  • For faster analytics, Lake Formation converts data into Apache Parquet and ORC before storing it in your S3 data lake.
  • AWS Glue DataBrew, a visual data preparation tool, allows data owners, subject matter experts, or users of all skill sets to participate in the data preparation process.

Security

  • Amazon Connect encrypts personally identifiable information (PII) contact data and customer profiles at rest using a time-limited key specific to your Amazon Connect instance.
  • All data exchanged between Amazon Connect and other AWS services, or external applications is always encrypted in transit using industry-standard transport layer security (TLS) encryption.
  • We recommend identity-based policies for most data lake environments to simplify resource access management and service permission for your data lake users.

Monitoring

  • Amazon Connect sends the instance’s usage data as Amazon CloudWatch metrics at a one-minute interval.
  • Data retention is about 2 weeks.

Analytics

  • A contact center data lake built on a descriptive, predictive, and real-time analytics portfolio helps you extract meaningful insights and respond to critical business questions.
  • For a highly scalable data warehousing solution, you can enable data streaming in Amazon Connect to stream CTRs into Amazon Redshift via Amazon Kinesis.

Machine learning

  • Building a data lake brings a new paradigm to contact center architecture, empowering your business to deliver enhanced and personalized customer service using machine learning (ML) capabilities.
  • Amazon Connect provides call attributes from telephony carriers, such as voice equipment’s geographic location to show where the call originated, phone device types such as landline or mobile, number of network segments the call traversed, and other call origination information.
  • Amazon Fraud Detector, you can create a ML model to identify potentially fraudulent activity by combining your datasets with Amazon Connect call attributes.

References

Original White Paper