Thought Spot On AWS Quick Start Guide 4.2
User Manual: Pdf
Open the PDF directly: View PDF .
Page Count: 26
Download | |
Open PDF In Browser | View PDF |
ThoughtSpot on AWS Quick Start Guide Version 4.2 February 2017 Table of Contents Contents Chapter 1: Welcome to ThoughtSpot....................................................3 Contact ThoughtSpot......................................................................................................... 4 Chapter 2: Introduction.............................................................................. 6 About AWS.............................................................................................................................. 7 Chapter 3: Configuration........................................................................... 8 Configuration options......................................................................................................... 9 Chapter 4: Installation and setup..........................................................12 About the AMI...................................................................................................................... 13 Launch an instance............................................................................................................. 13 Chapter 5: ThoughtSpot on AWS reference..................................... 19 Network ports......................................................................................................................20 ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 2 Chapter 1: Welcome to ThoughtSpot Welcome to ThoughtSpot Topics: • Contact ThoughtSpot Congratulations on purchasing the ThoughtSpot instance. This guide will get you started with setting up the Amazon Web Services (AWS) virtual machine (VM) offering. We hope your experience with ThoughtSpot is excellent. Please let us know how it goes, and what we can do to make it better. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 3 Welcome to ThoughtSpot Contact ThoughtSpot You can contact ThoughtSpot by phone, mail, email, or by filing a support ticket. File a support ticket If you encounter a technical issue, file a support ticket using the Support Portal ticket filing system at: http://support.thoughtspot.com/ Please provide as much detail as possible about your issue, to help us resolve it quickly. You need a Support Portal login to file a ticket. Please contact ThoughtSpot to get an account, if necessary. Address ThoughtSpot, Inc. 1 Palo Alto Square, Building 1, Suite 200 Palo Alto, CA 94306 Phone numbers Table 1: Phone numbers Phone Number Description 1-800-508-7008 ext 1 ThoughtSpot Support 1-800-508-7008 Toll free number for ThoughtSpot headquarters. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 4 Welcome to ThoughtSpot Email Table 2: Email addresses Reason for contacting Email For sales inquiries. sales@thoughtspot.com For customer support and software update support@thoughtspot.com inquiries. For other inquiries. ThoughtSpot on AWS Quick Start Guide hello@thoughtspot.com Copyright © 2017 by ThoughtSpot. All Rights Reserved. 5 Introduction Chapter 2: Introduction Introduction Topics: • About AWS Before you set up your ThoughtSpot instance, here is some information about the Amazon Web Services (AWS) cloud, Amazon Elastic Compute Cloud (EC2), Amazon Elastic Block Store (EBS), and how they all work with ThoughtSpot. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 6 Introduction About AWS AWS is a secure cloud services platform offered by Amazon. Using ThoughtSpot on AWS allows you to easily add instances as your dataset grows. You can do everything you'd normally want to do in a traditional database center with AWS. It features an on-demand delivery of IT resources and applications via the Internet with pay-as-you-go pricing. Amazon EC2 is based on instance types and the region in which your instances are running. When you are connected to an instance, you can use it just like you use any other server. There is no minimum fee and you only pay for what you use. Using Amazon EC2 lets you develop and deploy applications faster since there is no need to manage hardware. Therefore, it is easy to scale and manage computing capacity. As persistent block level storage volumes, Amazon EBS helps with scaling your EC2 instances. Each EBS volume is automatically replicated to protect you from component failure, and offers low-latency performance. About ThoughtSpot on AWS AWS can provide lots of memory and CPU for your ThoughtSpot instance, and it can be easily updated from development instances. Your database capacity will determine the number of instances you'll need and the instance network/storage requirements. In addition, you can go with multiple VMs based on your dataset size. The security group setting of your ThoughtSpot instance on AWS is up to you. You can find more information about which ports to open in the network ports reference. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 7 Configuration Chapter 3: Configuration Configuration Topics: • Configuration options ThoughtSpot engineering has performed extensive testing of the ThoughtSpot appliance on various Amazon Elastic Compute Cloud (EC2) and Amazon Elastic Block Store (EBS) configurations for best performance, load balancing, scalability, and reliability. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 8 Configuration Configuration options You can find information here on which configuration of memory, CPU, storage, and networking capacity you should be running for your instances. There are also details on how to configure your placement groups. Hardware configurations There is only one available hardware configuration for deploying ThoughtSpot on Amazon: • r4.16xlarge Below are charts depicting the specifications for the configuration for EC2 and EBS requirements. Table 3: EC2 requirements for deploying on Amazon Instance name Data capacity vCPUs DRAM r4.16xlarge Up to 250 GB 64 488 GB Table 4: EBS requirements for deploying on Amazon Instance name Data capacity Root volume Data volume (SSD or HDD) (SSD) r4.16xlarge Up to 250 GB 1 vol 200 GB 2 vols 400 GB each Attention: Both EC2 and EBS requirements must be fulfilled to deploy on Amazon. ThoughtSpot software license sizes ThoughtSpot only sells software licenses in multiples of 250 GB of data. So you can start with 250 GB, and add increments of 250 GB each time your data capacity needs increase. You can also choose to start off with more than 250 GB of data, as long as you know the best fit configuration for your data volume. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 9 Configuration Lego blocks If you aren't sure what kind of configuration you need, it might help to think of the hardware configurations in terms of simple Lego blocks. The r4.16xlarge size can be seen as a 250 GB block. Note: ThoughtSpot does not support sizes other than r4.16xlarge. Since the minimum data volume offered is 250 GB, you would need one r4.16xlarge block to match the data capacity. This scales linearly. So, 500 GB would require two r4.16xlarge blocks. Placement groups A placement group is a logical grouping of instances within a single availability zone. Placement groups are recommended for applications that benefit from low network latency, high network throughput, or both. ThoughtSpot relies on high connectivity between nodes of a cluster, which is why creating a placement group is recommended. Being in same placement group will give you the best shot at the highest bandwidth across AWS EC2 instances and the lowest latencies. This will make the node-node network reach the closest AWS promised specs. Our default recommendation for a multiinstance setup requires a placement group since it works best for our application performance. Also, AWS will provide jumbo frames (9000 MTU) support in such situations, and they don't charge extra for being in the same placement group. Having said that, ThoughtSpot will still work with EC2s in the cluster across placement groups in an availability zone. Related information: EC2 instance types EC2 pricing EBS pricing ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 10 Configuration Placement groups ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 11 Installation and setup Chapter 4: Installation and setup Installation and setup Topics: • About the AMI • Launch an instance Here is an overview of the installation and setup information for getting ThoughtSpot on Amazon Web Services (AWS) Elastic Compute Cloud (EC2). After you've determined your configuration options, you must setup your virtual machines (VMs) using an Amazon Machine Image (AMI). This AMI will be shared with you by ThoughtSpot. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 12 Installation and setup About the AMI An AMI is a preconfigured template that provides the information required to launch an instance. Check with your ThoughtSpot contact to learn about the latest version of the ThoughtSpot AMI. You must specify an AMI when you launch an instance. An AMI includes the following: • A template for the root volume for the instance (for example, an operating system, an appliance server, and applications). • Launch permissions that control which AWS accounts can use the AMI to launch instances. • A block device mapping that specifics the volumes to attach to the instance when it's launch. About ThoughtSpot's AMI The ThoughtSpot AMI comes provisioned with the custom ThoughtSpot image to make hosting simple. Once you've provided your AWS account ID and region where the VMs will be hosted, ThoughtSpot will share the current ThoughtSpot base AMI with you. This AMI has ThoughtSpot specific applications on an Ubuntu 12.04 base image. The EBS volumes required for ThoughtSpot install in AWS comes as part of the AMI. When you launch an EC2 instance from this image, the EBS volumes automatically get sized and provisioned. The storage attached to the base AMI is 200 GB (xvda), 2X400 GB (xvdb), and SSD gp2. It contains the max disks so that it can take care of the full load of the VM. Launch an instance Follow these steps to set up the VMs and launch ThoughtSpot. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 13 Installation and setup ThoughtSpot instances on AWS need AWS EC2 instances to be provisioned in the AWS account before ThoughtSpot can be installed and launched. Please make sure you follow the guidelines below for your EC2 details: • EC2 instance type: r4.16xlarge. • Networking requirement: 10GbE network is needed between the VMs. This is the default for the chosen VM type. • Security: VMs need to be accessible from each other, which means they need to be on the same Amazon Virtual Private Cloud (VPC) and subnetwork. Additional external access may be required to bring data in/out of the VMs to your network. • Number of EC2 instances needed: Based on the datasets, the number of EC2 instances needed will vary. Also for staging larger datasets (> 50 GB per VM), there may be a need to provision additional attached EBS volumes that are SSD gp2 provisioned. To set up the VMs and launch ThoughtSpot: 1. Log in to your AWS account from the AWS Amazon sign in page. 2. Provide ThoughtSpot Support with your AWS account ID and the region where the VMs will be hosted. They will then grant you permissions and share the current ThoughtSpot base AMI with you. Tip: You can find your account ID and region on the top right corner of the AWS console. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 14 Installation and setup Figure 1: AWS account ID and region 3. Navigate to the EC2 service dashboard by clicking Services, then select EC2. Figure 2: Navigate to the EC2 Dashboard 4. Make sure your selected region is correct on the top right corner of the dashboard. If not, select a different region you would like to launch your instance in. Let ThoughtSpot Support know if you change your region. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 15 Installation and setup Figure 3: Select a region to launch your instance in 5. Create an instance by clicking Launch Instance. Figure 4: Launch an instance 6. Select the appropriate AMI from the AMI Selection step by clicking Select. The ThoughtSpot shared AMI can be found under the My AMIs tab. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 16 Installation and setup Figure 5: Select the AMI 7. Select r4.16xlarge as the instance type. Then click Next: Configure Instance Details. 8. Configure the instances by choosing the number of EC2 instances you need based on your EC2 details. The instances need to be on the same VPC and subnetwork. Then click Next: Add Storage. 9. The default storage specified by the ThoughtSpot AMI should be populated. Optionally, you can add extra storage. Based on the dataset size requirement you might need to provision and prepare (formatting/file system placement) an extra storage of 400 GB per VM that is SSD gp2 provisioned. Click Next: Add Tags when you are done modifying the storage size. 10.Set a name for tagging your instances. Then click Next: Configure Security Group. 11. Select an existing security group to attach new security groups to such that it meets the security requirements for ThoughtSpot. Note: Security setting for ThoughtSpot: • The VMs need intragroup security, i.e. every VM in a cluster needs to be accessible from one another. For easier configuration, it is better to open all accesses from across VMs in a cluster. • Additionally, more ports need to be opened on the VM to provide data staging capabilities to your network. Check the network ports ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 17 Installation and setup reference to determine the minimum required ports that need to be opened for your ThoughtSpot appliance. 12.Click Review and Launch. After you have reviewed your instance launch details, click Launch. 13.Choose a key pair. A key pair consists of a public and private key used to encrypt and decrypt login information. If you don’t have a key pair, you should create one, otherwise you won’t be able to SSH into the AWS instance later on. 14.Click Launch Instances. Wait a few minutes for it to fully start up. Once it has started up, it will show up on the EC2 console. 15.Contact ThoughtSpot Support to complete your ThoughtSpot installation. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 18 Chapter 5: ThoughtSpot on AWS reference ThoughtSpot on AWS reference Topics: • Network ports This section contains a reference for ThoughtSpot on Amazon Web Services (AWS). Network ports This reference lists the potential ports to open when setting up your security group. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 19 ThoughtSpot on AWS reference Network ports For regular operations and for debugging, there are some ports you will need to keep open to network traffic from end users. Another, larger list of ports must be kept open for network traffic between the nodes in the cluster. Required ports for operations and debugging The following ports need to be opened up to requests from your user population. There are two main categories: operations and debugging. Table 5: Network ports to open for operations Port Protocol Service Name Direction 22 SSH bidirectional Administrators All nodes SSH Source Destination Description IP addresses Secure shell access. Also used for scp (secure copy). 80 HTTP HTTP bidirectional All users IP All nodes addresses Hypertext Transfer Protocol for website traffic. 443 HTTPS HTTPS bidirectional All users IP All nodes Secure HTTP. addresses 12345 TCP Simba bidirectional Administrators All nodes IP addresses Port used by ODBC and JDBC drivers when connecting to ThoughtSpot. Table 6: Network ports to open for debugging Port Protocol Service Name Direction 2201 HTTP bidirectional Administrator All nodes Orion master HTTP 2101 HTTP Oreo HTTP Source Destination Description IP addresses bidirectional Administrator All nodes IP addresses ThoughtSpot on AWS Quick Start Guide Copyright © Port used to debug the cluster manager. Port used to debug the node daemon. 2017 by ThoughtSpot. All Rights Reserved. 20 ThoughtSpot on AWS reference Port Protocol Service Name Direction 4001 HTTP bidirectional Administrator All nodes Falcon worker HTTP 4251 HTTP Sage master Source Destination Description IP addresses the data cache. bidirectional Administrator All nodes HTTP Port used to debug IP addresses Port used to debug the search engine. Required ports for inter-cluster operation Internally, ThoughtSpot uses static ports for communication between services in the cluster. Do not close these ports from inter-cluster network communications. In addition, a number of ports are dynamically assigned to services, which change between runs. The dynamic ports come from the range of Linux dynamically allocated ports (20K+). Table 7: Network ports to open between the nodes in the cluster Port Protocol Service Name Direction Source Dest. Description 80 TCP inbound All nodes All nodes Primary app HTTP nginx port (nginx) 443 TCP Secure nginx inbound All nodes All nodes Primary app HTTPS port (nginx) 2100 RPC Oreo RPC port bidirectional All nodes All nodes Node daemon RPC 2101 HTTP Oreo HTTP port bidirectional Admin IP All nodes Node daemon addresses HTTP and all nodes 2181 2200 RPC RPC Zookeeper servers bidirectional All nodes All nodes Zookeeper servers listen on this listen on this port for client port for client connections connections Orion master RPC bidirectional All nodes port ThoughtSpot on AWS Quick Start Guide All nodes Internal communication Copyright © 2017 by ThoughtSpot. All Rights Reserved. 21 ThoughtSpot on AWS reference Port Protocol Service Name Direction Source Dest. Description with the cluster manager 2201 HTTP Orion master bidirectional Admin IP HTTP port 2210 RPC Cluster stats All nodes Port used to addresses debug the cluster and all nodes manager bidirectional All nodes All nodes service RPC port Internal communication with the stats collector 2211 HTTP Cluster stats bidirectional Admin IP service HTTP port 2230 RPC Callosum stats All nodes Port used to addresses debug the stats and all nodes collector bidirectional All nodes All nodes collector RPC port Internal communication with the BI stats collector 2231 2240 HTTP RPC Callosum stats bidirectional Admin IP All nodes Port used to collector HTTP addresses debug the BI stats port and all nodes collector Alert manager bidirectional All nodes All nodes Port where alerting service receives alert events 2888 3888 RPC RPC Ports used by bidirectional All nodes All nodes Ports used by Zookeeper servers Zookeeper servers for communication for communication between between themselves themselves Ports used by bidirectional All nodes All nodes Ports used by Zookeeper servers Zookeeper servers for communication for communication ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 22 ThoughtSpot on AWS reference Port 4000 Protocol Service Name RPC Direction Source Dest. Description between between themselves themselves Falcon worker RPC bidirectional All nodes All nodes port Port used by data cache for communication between themselves 4001 HTTP Falcon worker bidirectional Admin IP HTTP port 4021 RPC Sage metadata All nodes addresses debug the data and all nodes cache bidirectional Admin IP All nodes HTTP Port where search service port addresses service contacts (exported by and all nodes metadata service Tomcat) 4201 Port used to Sage auto for metadata bidirectional Admin IP All nodes Port used to complete server addresses debug the search HTTP interface and all nodes service port 4231 HTTP Sage index server bidirectional Admin IP HTTP port 4232 RPC Sage index All nodes Port used to addresses debug the search and all nodes service bidirectional All nodes All nodes Port used server metadata for search subscriber port service internal communication 4233 RPC Sage index server bidirectional All nodes All nodes RPC port Port used for search service internal communication 4241 HTTP Sage auto bidirectional Admin IP All nodes Port used to complete server addresses debug the search HTTP port and all nodes service ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 23 ThoughtSpot on AWS reference Port Protocol Service Name Direction 4242 RPC bidirectional All nodes Sage auto Source Dest. Description All nodes Port used complete server for search RPC port service internal communication 4243 RPC Sage auto bidirectional All nodes All nodes Port used for complete server search internal metadata communication subscriber port 4251 RPC Sage master RPC bidirectional All nodes All nodes port Port used for search service internal communication 4405 RPC Diamond bidirectional All nodes All nodes (graphite) port Port used for communication with monitoring service 4500 RPC Trace vault service bidirectional All nodes All nodes RPC port Trace collection for ThoughtSpot services 4501 HTTP Trace vault service bidirectional Admin IP HTTP port All nodes addresses Debug trace collection and all nodes 4851 RPC Graphite manager bidirectional All nodes All nodes RPC port Communication with graphite manager 4852 HTTP Graphite manager bidirectional Admin IP HTTP port All nodes addresses Debug graphite manager and all nodes 4853 RPC Elastic search bidirectional All nodes All nodes Communication stack (ELK) with log search manager RPC port service ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 24 ThoughtSpot on AWS reference Port Protocol Service Name Direction 4853 HTTP bidirectional Admin IP Elastic search Source stack (ELK) addresses manager HTTP and all nodes Dest. Description All nodes Debug log search service port 5432 Postgres Postgres database bidirectional All nodes All nodes server port Communication with Postgres database 8020 RPC HDFS namenode bidirectional All nodes All nodes server RPC port Distributed file system (DFS) communication with clients 8080 HTTP Tomcat bidirectional All nodes All nodes BI engine communication with clients 8787 HTTP Periscope (UI) bidirectional All nodes All nodes service HTTP port 8888 HTTP HTTP proxy server back end bidirectional All nodes All nodes (tinyproxy) 11211 Memcached 12345 ODBC Administration UI Reverse SSH tunnel Memcached server bidirectional All nodes All nodes BI engine cache All nodes Port used for ETL port Simba server port bidirectional All nodes (extract, transform, load) 50070 HTTP HDFS namenode bidirectional All nodes All nodes server HTTP port 50075 HTTP HDFS datanode Debug DFS metadata bidirectional All nodes All nodes Debug DFS data server HTTP port Required ports for inbound and outbound cluster access ThoughtSpot uses static ports for inbound and outbound access to a cluster. ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 25 ThoughtSpot on AWS reference Table 8: Network ports to open for inbound access Port Protocol Service Name Direction 22 SCP SSH Source Dest. Description bidirectional ThoughtSpot All nodes Secure shell access. Support 80 HTTP HTTP bidirectional ThoughtSpot All nodes Hypertext Transfer Support Protocol for website traffic. 443 HTTPS HTTPS bidirectional ThoughtSpot All nodes Secure HTTP. Support 12345 TCP Simba bidirectional ThoughtSpot All nodes Port used by ODBC Support and JDBC drivers when connecting to ThoughtSpot. Table 9: Network ports to open for outbound access Port Protocol Service Name Direction Source Destination Description 443 HTTPS All nodes 208.83. HTTPS outbound 110.20 For transferring files to thoughtspot.egnyte.com (IP address 208.83.110.20). 25 SMTP or SMTP or outbound Secure SMTP 587 All nodes All nodes Allow outbound access and SMTP for the IP address of relay whichever email relay (provided by server is in use. This is customer) for sending alerts to ThoughtSpot Support. 389 TCP LDAP or or LDAPS outbound 636 All nodes All nodes Allow outbound access and LDAP for the IP address of the server LDAP server in use. (provided by customer) ThoughtSpot on AWS Quick Start Guide Copyright © 2017 by ThoughtSpot. All Rights Reserved. 26
Source Exif Data:
File Type : PDF File Type Extension : pdf MIME Type : application/pdf Linearized : No Page Count : 26 Profile CMM Type : lcms Profile Version : 2.1.0 Profile Class : Display Device Profile Color Space Data : RGB Profile Connection Space : XYZ Profile Date Time : 1998:02:09 06:49:00 Profile File Signature : acsp Primary Platform : Apple Computer Inc. CMM Flags : Not Embedded, Independent Device Manufacturer : IEC Device Model : sRGB Device Attributes : Reflective, Glossy, Positive, Color Rendering Intent : Perceptual Connection Space Illuminant : 0.9642 1 0.82491 Profile Creator : lcms Profile ID : 0 Profile Copyright : Copyright (c) 1998 Hewlett-Packard Company Profile Description : sRGB IEC61966-2.1 Media White Point : 0.95045 1 1.08905 Media Black Point : 0 0 0 Red Matrix Column : 0.43607 0.22249 0.01392 Green Matrix Column : 0.38515 0.71687 0.09708 Blue Matrix Column : 0.14307 0.06061 0.7141 Device Mfg Desc : IEC http://www.iec.ch Device Model Desc : IEC 61966-2.1 Default RGB colour space - sRGB Viewing Cond Desc : Reference Viewing Condition in IEC61966-2.1 Viewing Cond Illuminant : 19.6445 20.3718 16.8089 Viewing Cond Surround : 3.92889 4.07439 3.36179 Viewing Cond Illuminant Type : D50 Luminance : 76.03647 80 87.12462 Measurement Observer : CIE 1931 Measurement Backing : 0 0 0 Measurement Geometry : Unknown Measurement Flare : 0.999% Measurement Illuminant : D65 Technology : Cathode Ray Tube Display Red Tone Reproduction Curve : (Binary data 2060 bytes, use -b option to extract) Green Tone Reproduction Curve : (Binary data 2060 bytes, use -b option to extract) Blue Tone Reproduction Curve : (Binary data 2060 bytes, use -b option to extract) Format : application/pdf Language : en Date : 2017:02:27 12:56:13-08:00 Producer : Apache FOP Version 1.1 PDF Version : 1.4 Creator Tool : Apache FOP Version 1.1 Metadata Date : 2017:02:27 12:56:13-08:00 Create Date : 2017:02:27 12:56:13-08:00 Page Mode : UseOutlines Creator : Apache FOP Version 1.1EXIF Metadata provided by EXIF.tools