Azure Databricks Series - Part 4
Databricks Clusters

This is a continuation of my series of posts on Databricks where we most recently reviewed the Workspace & Notebooks. Now let’s get more familiar with the concept of clusters. Clusters Databricks breaks clusters into multiple categories: All-Purpose Clusters Job Clusters Pools Spark clusters consist of a single driver node and multiple worker nodes. The…

Azure Databricks Series - Part 3
Workspaces & Notebooks

Now that you’ve instantiated the Databricks service within Azure, let’s take a tour of the workspace & become familiar with Notebooks. Workspace The above image shows the Databricks homepage of this workspace. The left menu provides the majority of your options (outside of administration). Clicking on Workspace expands to the following: The workspace is divided…

Building a Tag Index with Jekyll and Liquid

Jekyll and Liquid provide a powerful yet easy-to-use set of tools for creating websites through both the plugins provided by Jekyll and the template markup provided by Liquid. While an integral part of Jekyll, Liquid is a stand-alone project widely implemented in other projects, including many other static site generators. The Jekyll page and post…

ML & AI for Software Developers - Part 16
Deep Learning

Deep learning is a subset of machine learning that relies primarily on neural networks. Most of what’s considered AI today is accomplished with deep learning. From recognizing objects in photos to real-time speech translation to using computers to generate art, music, poetry, and photorealistic faces, deep learning allows computers to perform feats of magic that…

Azure Databricks Series - Part 1
Intro to Azure Databricks

Many companies today have aging data architectures. As you look to modernize your traditional ETL pipeline, there is a tool you should keep in mind: Azure Databricks. During your move into Azure, there will probably be a place for Azure Databricks. In the past, general DTS/SSIS pipelines and SQL Server engines were sufficient but with…

Using GitHub Actions to Manage Certbot (Let’s Encrypt) Certificates

GitHub Actions is an excellent source for all things automation. For personal accounts, there’s a limited free offering that allows you to run automation jobs. I use GH actions to update my websites. I don’t think I would have written this if no existing solution worked with GitHub Actions. But after a search, I did…

ML & AI for Software Developers - Part 15
Building Machine-Learning Models with ML.NET

Scikit-learn is arguably the world’s most popular machine-learning framework. The efficacy of the library, the documentation that accompanies it, and the mindshare that surrounds it are the primary reasons more ML models are written in Python than any other language. But Scikit isn’t the only machine-learning framework. Others exist for other languages, and if you…

Azure Migrations - Part 4
Database Migration Assistant Assessment

Azure offers a lot of options for moving SQL databases from SQL Server to Azure. Knowing which choice is the best choice for your database can sometimes be challenging. Microsoft provides a tool, the Database Migration Assistant, that assesses your database to show you if it is compatible or not. And what changes you may…

ML & AI for Software Developers - Part 14
Operationalizing Machine-Learning Models

All of the machine-learning models presented so far in this series were written in Python. Models don’t have to be written in Python, but many are, thanks in part to the numerous world-class Python libraries that are available, including Pandas and Scikit-learn. ML models written in Python are easily consumed in Python apps. Calling them from…