
After we can get the raw data, the next step an analyst do is usually to check for some statistics of it. Using SQL we can do that in straightforward manner as follows.
Continue reading “SQL 102”
After we can get the raw data, the next step an analyst do is usually to check for some statistics of it. Using SQL we can do that in straightforward manner as follows.
Continue reading “SQL 102”
The most efficient way to get our data is by having a direct access to the database. And fortunately, the most analyzed data is still in structured format so far. This post will assume that we have a database and a client app installed in our laptop. We can choose to install an open source database like Postgre or MySQL. As client app, we can use DBeaver or the equivalent. After connecting the client app to rhe database, we can start to roll. There are three commands as baseline pattern for a query. Keep reading to know more!
Continue reading “SQL 101”
This role deals a lot with handling data requests and past performance analysis. The first required understanding on how the data is generated and where to find them. Still straightforward if we deal with 10 or 20 tables. But what if the tables reaches 100, 200 or more? The second is even more complex. After we get the data, how we can be confident with the data at our disposal? Is it clean enough, as in no testing data or outliers inside? To overcome these challenges, these are three most important skill for data analysts.
Continue reading “Must Have Skills for Data Analyst”
With a wide and variety of work, to understand data and provide business with recommendations, we will need a strong team with multiple roles interact with each other. This team also needs to live in two distinctive and different worlds. One is emerged from business point of view, where what is important is either increasing company’s revenue, or reducing the cost. The other one is from engineering where we need to understand where to get the data, how to clean, explore, train and implement our scripts such that we can make the code work and alert us with minimum supervision. In general, we need a team which covers three roles. There are new roles emerged, of course. But the root came from the original three. Read on to read what are those three.
Continue reading “Data Related Roles”
This kind of data is a bit abstract to imagine. If we don’t come from computer science background, let’s imagine that this type is somewhat like a table in Excel file, but the cell can have another table inside. Keep reading for further explanation.
Continue reading “Semi Structured Data”
Have you ever wondered how Youtube stores our videos? Not only they need to play it well, but they also need to search it as fast as we type our keywords. This new kind of data does not fit well with the existing approach of structured data. We need a new way to store and retrieve this kind of data.
Continue reading “Unstructured Data”
This is the most common data classification we can find. Been here very long and it will be here to stay amid incoming new waves of technology in data processing. Reason is simple. It is reliable, mature, fast and scalable for transactional data. For more details about this data type, keep reading!
Continue reading “Structured Data”
With so many data around us, sometimes we wonder if there are some way to group them based on some characteristics. Fortunately, the answer is yes. We can have some patterns to identify what kind of data we see around us. Basically, all data can be classified into three categories.
Continue reading “What kind of data available out there?”
In simple terms, we can say data is one or more attributes to measure something. Example could be our body. We have weight, height, nose length, gender, hair color, skin color etc to describe our physical appearance.
Continue reading “So, What is Data?”
After knowing that consumer sector is the largest driver of economy in my previous post, the next step is to pick which company to go. We will focus on publicly listed company since they have to provide their financial report periodically to Indonesian Stock Exchange (IDX). The full list is available in here and it is updated daily if there are new additions or removals. We only need to select Consumer Goods as the sector for now. There are three parts of financial report namely Asset Liabilities, Profit Loss and Cash Flow. This post will focus on Asset Liabilities first. The other will come after. Read on for more details on this.
Continue reading “Understanding Financial Report – Part I”