Posts: Aws

Placeholder image

Laszlo Puskas

Mon, Jul 16, 2018

Cloud agnostic cluster recommendations for Kubernetes

A few weeks back we released Telescopes, our Kubernetes cluster layout recommender application; the application has evolved quite a lot since then - this post tries to give an insight into the new features and the recent changes. Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes tl;dr: We added new features to Telescopes as support for blacklisting or whitelisting instance types Recommendation accuracies can be checked There is support for asking cloud instance types not just on CPU and memory but network performance as well The cloud Productinfo - the one place where you find all information about cloud providers supported instance types has been updated.

Read more...


Placeholder image

Marton Sereg

Thu, Jun 21, 2018

Cloud instance type and price information as a service

When we started to work on our cluster infrastructure recommender, Telescopes, we recognized that it’s not an easy thing to get instance type attributes and pricing information from the cloud providers programatically. While EC2, Google Cloud and Azure all provide some kind of APIs to query this information, in some cases these APIs are responding with partly inconsistent data or the responses are large chunk of JSON files that are very cumbersome to parse.

Read more...


Placeholder image

Flora Piszker

Wed, May 23, 2018

Create Kubernetes clusters in the cloud

Creating Kubernetes clusters in the cloud and deploy (or CI/CD) applications to them is not always trivial. While there are a few options out there, those are either cloud or distribution specific. While we were working on our open source Pipeline Platform we needed a solution which covers these (inclusive but not an exhaustive list of requirements): provisions Kubernetes clusters on all major cloud providers (via REST, UI and CLI) using a unified interface manages application lifecycle (on-demand deploy, CI/CD, dependency management, etc) preferably over a REST interface supports multi tenancy, and advanced security scenarios (app to app security with dynamic secrets, standards, multi-auth backends, and more) ability to build cross-cloud or hybrid Kubernetes environments This posts highlights the ease of creating Kubernetes clusters using the Pipeline API on the following providers:

Read more...


Placeholder image

Marton Sereg

Thu, May 3, 2018

Cloud instance type recommendation

Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes A few months ago we had a post about overspending in the cloud where we were discussing how difficult it is to keep track of the vast amount of instance types and pricing options of the cloud providers, especially on AWS with spot pricing.

Read more...


Placeholder image

Lajos Papp

Fri, Apr 20, 2018

Control your AWS spendings with ChatOps

While we are building our open source, cloud agnostic Heroku / Cloud Foundry like Paas, Pipeline built on top of Kubernetes we launch lots of clusters on different cloud providers. Most of these clusters are launched on spot or preemptible instances and managed by Hollowtrees, however there are many smaller development clusters, control planes, instances and proof of concepts we regularly do and they are marginally related or launched with Pipeline.

Read more...


Placeholder image

Sandor Magyari

Mon, Apr 16, 2018

Collecting Spark History Server event logs in the cloud

Apache Spark on Kubernetes series: Introduction to Spark on Kubernetes Scaling Spark made simple on Kubernetes The anatomy of Spark applications on Kubernetes Monitoring Apache Spark with Prometheus Apache Spark CI/CD workflow howto Spark History Server on Kubernetes Spark scheduling on Kubernetes demystified Spark Streaming Checkpointing on Kubernetes Deep dive into monitoring Spark and Zeppelin with Prometheus Spark Streaming Checkpointing on Kubernetes Deep dive into monitoring Spark and Zeppelin with Prometheus Apache Spark application resilience on Kubernetes

Read more...


Placeholder image

Marton Sereg

Mon, Apr 9, 2018

Draining Kubernetes nodes

Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes Cluster recommender Cloud instance type and price information as a service Kubernetes was designed in a way to be fault tolerant to worker node failures. If a node goes missing because of a hardware problem, a cloud infrastructure problem, or in general Kubernetes simply no longer receives heartbeat messages from that node because of any reason, the Kubernetes control plane is clever enough to handle these failures.

Read more...


Placeholder image

Marton Sereg

Mon, Feb 12, 2018

Diversifying AWS auto-scaling groups, or how to write a Hollowtrees action plugin

Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes Cluster recommender Cloud instance type and price information as a service You may remember the Hollowtrees project we’ve open sourced a few weeks ago - a framework to manage AWS spot instance clusters with a few batteries included: Hollowtrees is an alert-react based framework part of the Pipeline PaaS which coordinates monitoring, applies rules and dispatches action chains towards plugins using standard CNCF interfaces AWS spot instance termination Prometheus exporter AWS autoscaling group Prometheus exporter AWS Spot Instance recommender Kubernetes action plugin to execute k8s operations (e.

Read more...


Placeholder image

Marton Sereg

Mon, Feb 5, 2018

Monitor AWS spot instance terminations

Cloud cost management series: Overspending in the cloud Managing spot instance clusters on Kubernetes with Hollowtrees Monitor AWS spot instance terminations Diversifying AWS auto-scaling groups Draining Kubernetes nodes Cluster recommender Cloud instance type and price information as a service Last week we have opensourced the Hollowtrees project - a framework to manage AWS spot instance clusters with a few batteries included: Hollowtrees is an alert-react based framework part of the Pipeline PaaS which coordinates monitoring, applies rules and dispatches action chains towards plugins using standard CNCF interfaces AWS spot instance termination Prometheus exporter AWS autoscaling group Prometheus exporter AWS Spot Instance recommender Kubernetes action plugin to execute k8s operations (e.

Read more...


Placeholder image

Sandor Magyari

Thu, Jan 18, 2018

Introduction to distributed TensorFlow on Kubernetes

Last time we were discussing about how our Pipeline PaaS is deploying and provisioning an AWS EFS filesystem on Kubernetes and what are the performance benefits for Spark or TensorFlow. This post is about: Introduction to TensorFlow on Kubernetes Benefits of EFS for TensorFlow (store image data for TensorFlow jobs) Pipeline uses the kubeflow framework to deploy: A JupyterHub to create & manage interactive Jupyter notebooks A TensorFlow Training Controller that can be configured to use CPUs or GPUs A TensorFlow Serving container Note that beside the ones above Pipeline also has default Spotguides for Spark and Zeppelin as well to support your datascience experience

Read more...


Placeholder image

Sandor Magyari

Mon, Jan 15, 2018

Amazon Elastic File System on Kubernetes

At Banzai Cloud we provision different frameworks and tools like Spark, Zeppelin and most recently Tensorflow, all running on our Pipeline PaaS (built on Kubernetes). One of Pipeline’s early adopter is running a Tensorflow Training Controller using GPUs on AWS EC2 wired into our CI/CD pipeline and needed significant parallelization for reading training data. We have introduced support for Amazon Elastic File System and will make it publicly available in the forthcoming release of Pipeline.

Read more...