I was recently re-watching my Power BI Mixtape: Vol. 1 presentation and recalled a question from the audience as I was going through one of my Power Query demos. They suggested that I could update my solution to split a column by a line feed special character – and they were correct! I have outlined my previous approach to the demo in this post here:
As a quick recap, in this example I connected to an online data source and my goal was to have a simple table with three columns:
- FSA
- City
- Location within the city
The source data was in a matrix format, with data repeated across rows and columns in a table. Within each cell, there were three “levels” of information that correspond to the three columns I want in my clean table.
data:image/s3,"s3://crabby-images/90b73/90b7344173ecf094eef5f47d549bb1b4e2a63e0f" alt=""
Starting from the matrix, with repeated cells across columns and rows, I first consolidated all of the information I needed into a single column. Then I removed the other columns. This left me with a single column of data – each row containing three lines of information that I wanted to split into columns.
data:image/s3,"s3://crabby-images/60ee1/60ee1595599c2e43091c623875002be2d6a6b3fa" alt=""
At this point, in my previous solution, I removed the line feeds so that all of the information in each cell was on a single line. I then had to rely on a few different patterns to correctly split my data into three columns.
The solution worked, but it was a bit complicated and there were a few edge cases to account for.
data:image/s3,"s3://crabby-images/e2461/e24612d0b1069a8c542ac734163883b2808f7df3" alt=""
To simplify my transformation, rather than using the Text.Clean
function to remove the line feeds within each cell and THEN starting to try and split the strings, I leveraged these line feeds to split the column.
I started by splitting the source column by the first line feed "#(lf)"
character. This gave me my first clean column for the forward sortation area.
data:image/s3,"s3://crabby-images/fcaba/fcabaa4a5c86ac9d3716101f9aff4ae1470c4176" alt=""
I repeated this process to finalize the two additional columns that I needed: the city and a description.
data:image/s3,"s3://crabby-images/92a13/92a139a3e73e51d6589e28bbd128e04d0a0906ee" alt=""
Finally, I used the Text.Clean
function to remove any remaining line feeds.
data:image/s3,"s3://crabby-images/ae3c2/ae3c2077e37821694da28eb3ad70398491234ee3" alt=""
This got me to the same tidy data source for my report, but with transformation steps that were easier to read, audit, and maintain.