More Google Analytics
Key Points
- Google Analytics go beyond simple Web page analytics and navigation
- Google Cloud Platform - GCP - Cloud server runtimes
- Google Stream Analytics - GSA - Google streaming analytics solution models
- Google Dataflow - GDF - serverless batch, stream data pipelines
- Google Dataprocs - GDP - server based Apache Spark, Hadoop pipelines
- Google BigTable - GBT - BigData store in GCP
- Google PubSub - GPS - Create, subscribe, publish event stream data to pipelines
- Google Machine Learning - GML - standard or custom machine learning algorithms applied to stream data
- Google BigQuery - GBQ - SQL like interface to NOSQL data stores
- Google Data Studio - GDS - Visual design toolset for reports, queries using GBQ, GBT etc
- Google Analytics can integrate multiple information sources
References
Reference_description_with_linked_URLs________________________ | Notes_____________________________________________________________ |
---|---|
Google Cloud Platform docs - GCP | |
https://cloud.google.com/solutions/big-data/stream-analytics/ | Google Stream Analytics Platform - GSA |
https://cloud.google.com/dataflow/ | DataFlow - GDF A serverless simplified stream and batch data processing, with equal reliability and expressiveness Cloud Dataflow seamlessly integrates with GCP services for streaming events ingestion (Cloud Pub/Sub), data warehousing (BigQuery), machine learning (Cloud Machine Learning), and more. Its Beam-based SDK also lets developers build custom extensions and even choose alternative execution engines, such as Apache Spark via Cloud Dataproc or on-premises. For Apache Kafka users, a Cloud Dataflow connector makes integration with GCP easy. |
https://cloud.google.com/dataproc/ | DataProc - GDP cost-effective way to run Apache Spark and Apache Hadoop Cloud Dataproc is a fast, easy-to-use, fully-managed cloud service for running Apache Spark and Apache Hadoop clusters in a simpler, more cost-efficient way. Operations that used to take hours or days take seconds or minutes instead, and you pay only for the resources you use (with per-second billing). Cloud Dataproc also easily integrates with other Google Cloud Platform (GCP) services, giving you a powerful and complete platform for data processing, analytics and machine learning. |
https://cloud.google.com/bigtable/ | BigTable - GBT - a 3D store for array maps - row, column, timestamp with partitioning, replication |
https://cloud.google.com/pubsub/ | PubSub - GPS - Ingest event streams from anywhere, at any scale, for simple, reliable, real-time stream analytics Cloud Pub/Sub is a simple, reliable, scalable foundation for stream analytics and event-driven computing systems. As part of Google Cloud’s stream analytics solution, the service ingests event streams and delivers them to Cloud Dataflow for processing and BigQuery for analysis as a data warehousing solution. Relying on the Cloud Pub/Sub service for delivery of event data frees you to focus on transforming your business and data systems |
BigQuery - GBQ -BigQuery is Google's fully managed, petabyte scale, low cost analytics data warehouse. BigQuery is NoOps—there is no infrastructure to manage and you don't need a database administrator—so you can focus on analyzing data to find meaningful insights, use familiar SQL, and take advantage of our pay-as-you-go model. | |
https://cloud.google.com/bigquery/docs/transfer-service-overview | BigQuery - data transfer service options ( Google or 3rd party like FiveTran ) |
https://marketingplatform.google.com/about/data-studio/ | Data Studio - GDS toolset to create reports using BigQuery |
Key Concepts
Google Cloud Analytics Platform
Google Cloud Analytics Platform for IOT integration
Potential Value Opportunities
- DMX Support Operations analytics
- Embedded analytics in DMX platform applications
- Customer self-service reporting for selected dealer, OEM admins
Potential Challenges
Candidate Solutions
Step-by-step guide for Example
sample code block