Tags: cluster

The What and the Why of the Cluster API

Throughout the evolution of software tools there exists a tension between generalization and partial specialization. A tool’s broader adoption is a form of natural selection, where its evolution is predicated on filling a given need, or role, better than its competition. This premise is imbued in...
Read 0 Comments

6 Best Practices for Highly Available Kubernetes Clusters

A highly available microservice requires that the system gracefully handle the failures of its components. If one part of the system fails in any way, the system can recover without significant downtime. So how exactly can you achieve a highly available, highly reliable, and multi-master Kubernetes...
Read 0 Comments

New TOP500 List Led by DOE Supercomputers

The latest TOP500 list of the world’s fastest supercomputers is out, a remarkable ranking that shows five Department of Energy supercomputers in the top 10, with the first two captured by Summit at Oak Ridge and Sierra at Livermore. With the number one and number two systems on the planet, the “...
Read 0 Comments

Slurm Job Scheduling System

In previous articles, I examined some fundamental tools for HPC systems, including pdsh (parallel shells), Lmod environment modules, and shared storage with NFS and SSHFS. One remaining, virtually indispensable tool is a job scheduler. One of the most critical pieces of software on a shared cluster...
Read 0 Comments

Connect Everything: A Look at How NATS.io can Lead to a Securely Connected World

Developing and deploying applications that communicate in distributed systems, especially in cloud computing, is complex. Messaging has evolved to address the general needs of distributed applications but hasn’t gone far enough. We need a messaging system that takes the next steps to address cloud...
Read 0 Comments

Testing Kubernetes RBAC

Securing your Kubernetes cluster is one thing, keeping it secure is a continuous uphill struggle. However, with the introduction of new features to Kubernetes it is becoming much easier to do both. Kubernetes (as of version 1.6) has introduced the concept of Role-Based Access Control (RBAC), allows...
Read 0 Comments

GLB: GitHub's Open Source Load Balancer

At GitHub, we serve tens of thousands of requests every second out of our network edge, operating on GitHub’s metal cloud. We’ve previously introduced GLB, our scalable load balancing solution for bare metal datacenters, which powers the majority of GitHub’s public web and git traffic, as well as...
Read 0 Comments

pdsh Parallel Shell

For HPC, one of the fundamentals is being able to run a command across a number of nodes in a cluster. A parallel shell is a simple but powerful tool that allows you to do so on designated (or all) nodes in the cluster, so you do not have to log in to each node and run the same command. This single...
Read 0 Comments

Implementing Advanced Scheduling Techniques with Kubernetes

One of the advantages of using an advanced container orchestration tool like Kubernetes is the flexible scheduler. This provides a wide range of options for users to specify conditions for assigning pods to particular worker nodes that satisfy a condition, and not just based on available resources...
Read 0 Comments

Recommendations for High-Performance Computing on OpenStack

Over the last year, I’ve been working on use cases with high-performance computing (HPC) on OpenStack. In this post, I’ll offer some considerations about hosting high performance and high-throughput workloads. First, let’s start with the three types of architectures that can be used when hosting...
Read 0 Comments

Pages

Click Here!