Kevin Feasel – Page 356

Following on in my series, in this blog post I am going to use the dataflow Gen2 in Microsoft Fabric to load the data into a lake house table.

By doing this, it will allow me to store the data in a delta lake table.

In this series I am going to show you all the steps I did to have the successful outcome I had with my client.

Click through for links to the first two parts of the series, as well as a step-by-step guide for part 3.

Comments closed

Structured Programming in R with Logic and Flow Control

Published 2023-08-30 by Kevin Feasel

Adrian Tam continues a primer on R:

R is a procedural programming language. Therefore, it has the full set of flow control syntax like many other languages. Indeed, the flow control syntax in R is similar to Java and C. In this post, you will see some examples of using the flow control syntax in R.

Read on for examples of flow control (if/else, for, etc.) and creating functions.

Comments closed

Flink Streaming Use Cases for Kafka Users

Published 2023-08-30 by Kevin Feasel

Jean-Sebastien Brunner gives us some use cases:

In Part One of our “Inside Flink” blog series, we explored the critical role of stream processing and why developers are increasingly choosing Apache Flink® over other frameworks.

In this second installment, we’ll showcase how innovative teams across every industry and size are putting stream processing into practice – from streaming data pipelines to train ML models or more timely analytics to fraud detection in finance and real-time inventory management in retail. We’ll also discuss how Flink is uniquely suited to support a wide spectrum of use cases and helps teams uncover immediate insights in their data streams and react to events in real time.

This article stays more at the “art of the possible” level rather than drilling into how we can do it.

Comments closed

The Concept of Schema in Relational Databases

Published 2023-08-30 by Kevin Feasel

Adron Hall explains how different relational database management systems describe schemas:

From the viewpoint of someone familiar with the general idea of a schema, it can indeed seem unusual that databases like SQL Server, Oracle, MariaDB/MySQL, and PostgreSQL each interpret and implement schemas in slightly (or sometimes, vastly) different ways. While the core idea behind a schema as a structured container or namespace for database objects remains somewhat consistent, the exact nature, utility, and behavior of schemas vary across these systems.

Read on for an overview of these for four products, as well as what the ANSI standard indicates.

Comments closed

Visualizing when Lower is Better

Published 2023-08-30 by Kevin Feasel

Alex Velez inverts a common experience:

When quickly scanning, I wonder why the direct and indirect sales teams underperformed in 2022. Mostly, they fell below the goal of 90 days, exceeding their target only three times.

Now, pausing to think more critically about the context of this scenario, I realize I’ve misread the graph—specifically the goal line. Targets and goals are often seen as minimum thresholds, not maximum limits. But in the sales industry, the goal is to close a deal as quickly as possible. In this visual, below the goal line is actually a good thing!

This graph challenges my standard construct of targets and goals, which could lead to confusion or, worse, the wrong conclusions if I’m not careful.

Read on for five alternative ways to display this graph and (hopefully) reduce confusion.

Comments closed

Setting Table and Matrix Column Widths in Power BI

Published 2023-08-30 by Kevin Feasel

Kurt Buhler controls the horizontal, Kurt Buhler controls the vertical:

One challenge of the table and matrix visuals in Power BI is that it’s difficult to precisely and consistently set column widths. Unlike in Excel, where you can set the row and column widths in a spreadsheet, you have no option in the visual interface to control the column width property. However, it’s still possible to control it in the report metadata, which is exposed in the officially supported Power BI Projects format (.pbip) which is in preview. Notably, however, opening and modifying report metadata from this format isn’t yet supported. Despite that fact, it still works reliably, so I thought I’d demonstrate how to do this.

There are a fair number of steps involved but it all makes sense in the end.

Comments closed

Shortcuts in Microsoft Fabric

Published 2023-08-30 by Kevin Feasel

Adam Saxton explains the power of shortcuts:

We feel shortcuts are one of the most power capabilities within OneLake in Microsoft Fabric! Adam walks through what these are and how you can use them.

Click through for a video and a couple of Microsoft Learn links on the topic of shortcuts.

Comments closed

Data Type Conversions and Snowflake Performance

Published 2023-08-30 by Kevin Feasel

Kevin Wilkie is implicit in this whole thing:

One of the ways we can get better at speed is to attempt several slightly different ways that can get you (hopefully) the same data. Some tables work better with one query while some work better with another query.

Let’s work through a scenario in Snowflake and we’ll see which one is faster under “normal” conditions.

Click through for a few query examples and how they end up performing.

Comments closed

Comparing the Microsoft Fabric Data Wrangler and Power Query Editor

Published 2023-08-30 by Kevin Feasel

Reza Rad performs a comparison:

Power Query Editor and Data Wrangler are data transformation and preparation tools in Microsoft Fabric. There are similarities between these two tools. However, there are differences, too. It is essential to know the capabilities of each tool to understand which one should be used for what purpose and scenario. In this article, this is our quest.

Reza includes a video and an article. Reza also has a summary chart at the bottom.

Comments closed

Versioned State Store in Kafka Streams

Published 2023-08-29 by Kevin Feasel

Victoria Xia announces new functionality in Apache Kafka 3.5:

Since the introduction of stream processing, there have been three certainties in life: death, taxes, and out-of-order data. As a stream processing library built for Apache Kafka, Kafka Streams processes data in offset order. When out-of-order data is present, offset order differs from timestamp order and care must be taken to ensure that processing results respect timestamp order where appropriate. The introduction of versioned state stores to Kafka Streams in the Apache Kafka 3.5 release is a huge milestone in this direction.

In this blog post, I’ll address the what, why, and how of versioned stores in Kafka Streams, including what they are, why you might like to use them, how to get started, and a couple of things to watch out for when upgrading.

Read on to see what this entails and how you can try it out yourself.

Comments closed

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

Author: Kevin Feasel

Storing Log Analytics Data in the Microsoft Fabric Lakehouse

Structured Programming in R with Logic and Flow Control

Flink Streaming Use Cases for Kafka Users

The Concept of Schema in Relational Databases

Visualizing when Lower is Better

Setting Table and Matrix Column Widths in Power BI

Shortcuts in Microsoft Fabric

Data Type Conversions and Snowflake Performance

Comparing the Microsoft Fabric Data Wrangler and Power Query Editor

Versioned State Store in Kafka Streams