2022. sze 14.

Compensation outliers SOLUTION

írta: dataanalyticsdemo

DataLemur, easy tier

Tovább Szólj hozzá

2022. sze 06.

First Transaction SOLUTION

írta: dataanalyticsdemo

DataLemur, free tier, MEDIUM

Nothing special here again, use can use CTE or a subselect, the main target of this question was to get you to use RANK() or ROW_NUMBER() WINDOW FUNCTION.

Tovább Szólj hozzá

2022. sze 06.

Highest Number of Products

írta: dataanalyticsdemo

Solution explanation

DataLemur, free tier

This goes over the basics pretty much.

use of aggregates

use of HAVING

use of multiple order by conditions

use of limit;

Not tricky

Tovább Szólj hozzá

2022. sze 05.

Dense rank with sum in the over clause

írta: dataanalyticsdemo

Tovább Szólj hozzá

2022. sze 04.

Most profitable companies SOLUTION

írta: dataanalyticsdemo

Stratascratch, free tier

Tovább Szólj hozzá

2022. aug 21.

COVID monthly NEW cases HU

írta: dataanalyticsdemo

-spike in the winter is bigger than last year

-even summer cases are higher this year

Tovább Szólj hozzá

2022. aug 21.

Finding user purchases SOLUTION

írta: dataanalyticsdemo

Tovább Szólj hozzá

2022. aug 21.

Most profitable companies SOLUTION

írta: dataanalyticsdemo

Stratascratch, free tier.

Tovább Szólj hozzá

2022. aug 14.

Monthly Percentage Difference SOLUTION

írta: dataanalyticsdemo

Stratascratch, free tier, hard

Tovább Szólj hozzá

2022. aug 14.

Largest Olympics SOLUTION

írta: dataanalyticsdemo

Stratascratch, free question

Tovább Szólj hozzá

2022. aug 14.

Create Local Server for SSMS

írta: dataanalyticsdemo

And load mdf file on it

First you need a server to use management studio with..

Install sql server 2019 version

Now you can login with the first connection string

Copy the mdf file in that folder that this prompts up and select it.

And this is good to go.

Tovább Szólj hozzá

2022. aug 13.

COVID monthly NEW cases HU

írta: dataanalyticsdemo

and monthly confirmed cases total

with data_21 as (
select * from (
select country_region, date as date_21, confirmed as confirmed_21, deaths as deaths_21
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day_21
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases_21
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND date between '2020-01-01'
AND (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
)),

MONTHLY_NEW_CASES AS (
select
left(cast(date_21 as string),7) as yearmonth
,sum(new_cases_21) as res
from data_21
group by 1
order by 1 asc
),

TOTAL_CASES_MONTHS AS (
SELECT
left(cast(date_21 as string),7) as yearmonth
,MAX(confirmed_21) as res
from data_21
group by 1
order ...

Tovább Szólj hozzá

2022. már 19.

Some Python String methods

írta: dataanalyticsdemo

Tovább Szólj hozzá

2022. már 15.

Pandas groupby example

írta: dataanalyticsdemo

Fictive game expenses

Tovább Szólj hozzá

2022. már 07.

Pandas str replace

írta: dataanalyticsdemo

import pandas as pd

pets = pd.Series(['Cirmi ', "Bund@s", "Gyereide"])
pets
pets = pets.str.replace("@", "á")
pets
pets.str.len()
pets = pets.str.strip()
pets.str.len()

Tovább Szólj hozzá

2022. már 06.

Obligatory COVID-19 cases chart HU

írta: dataanalyticsdemo

Tovább Szólj hozzá

2022. feb 28.

Chuck Norris jokes with API

írta: dataanalyticsdemo

Website is: https://api.chucknorris.io/

Alright, let's see what we can do:

This is a web source, no authentication needed,so:

Just connect aaand..

Convert to columns in table and keep only the value column which is the joke.

Now for the categories:

Tovább Szólj hozzá

2022. feb 28.

PowerQuery apply custom function to column

írta: dataanalyticsdemo

Tovább Szólj hozzá

2022. feb 26.

PowerBI use value from another table

írta: dataanalyticsdemo

Variables in DAX formulas

Let's make a summarized a table:

Profit_Table = 
SUMMARIZECOLUMNS ( 
Orders[State] , 
Orders , 
"Profit by state" , sum ( Orders[Profit] ) 
) 

Now, we have a separate excel file with 2 values

Let's make our summarized table use this profit value.

Profit_Acceptable = 
var profitlimit = max ( VariableTable[ProfitPass] ) 
return 
if ( Profit_Table[Profit by state] >= profitlimit , 1 , 0 ) 
I know what-if parameters can also be used, but in my experience 
they tend to froze with big numbers. 

Tovább Szólj hozzá

2022. feb 12.

COVID monthly cases HU

írta: dataanalyticsdemo

How did it get his high?

with data_21 as (
select * from (
select country_region, date as date_21, confirmed as confirmed_21, deaths as deaths_21
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day_21
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases_21
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND date between '2022-01-01'
AND (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
))
select
*
from data_21

Tovább Szólj hozzá

2022. feb 12.

PowerBI see query behind visual

írta: dataanalyticsdemo

Step 1: Turn on performance analyzer and select the analyze this visual option.

After that, performance analyzer adds some lines:

Click copy query and paste anywhere:

And you see, there is the distinct value on it. Let's add something unique, like the orderID:

These are auto-generated in the background..And we reached the end of this idea.

Tovább Szólj hozzá

2022. feb 04.

Titanic dataset part I

írta: dataanalyticsdemo

From easiest to hardest

So the basics, nothing hard here:

shape()
drop()
[['Select1', 'Select2']]
isnull()
sum()
info()
head()
query()

Tovább Szólj hozzá

2022. feb 03.

Exploring public complaint data

írta: dataanalyticsdemo

Special focus on the coyotes

Let's see the metadata first

SELECT column_name, data_type, description
FROM
`bigquery-public-data`

.austin_311.

INFORMATION_SCHEMA.COLUMN_FIELD_PATHS
WHERE
table_name="311_service_requests"

Let's see how many years this database encompasses:

Looks like a lot of time.

As the dataset seems to have changed or disappeared from the public database, this is the end of this analysis.

Tovább Szólj hozzá

2022. jan 22.

Cities With The Most Expensive Homes

írta: dataanalyticsdemo

Solution

Tovább Szólj hozzá

2021. dec 30.

COVID CASES HU last 3 months

írta: dataanalyticsdemo

Same as always, the weekends get added to the next Monday's results.

This causes spikes, however we can see that these spikes are going down too fortunately.

Let's remove these spike days.

Doesn't really change. Looks like they test more on the beginning of the week and by the end, it drops.

with data_21 as (
select * from (
select country_region, date as date_21, confirmed as confirmed_21, deaths as deaths_21
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day_21
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases_21
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND date between '2021-10-01'
AND (select max(date) from ...

Tovább Szólj hozzá

2021. dec 22.

Users By Avg Session time --Stratascratch

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. dec 20.

Workers With The Highest Salaries Solution

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. dec 20.

PowerBI define table from scratch

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. dec 09.

Power BI Date Intelligence setup

írta: dataanalyticsdemo

Mistakes to avoid

For this to work, we need a Calendar table, or in another name a date table.

Why isn't one of our existing columns in our data good? Because:
-we need a table which contains all dates without skipping non-business days, holidays etc so all 365 days in a year.
-all the dates must occur only once (while in our tables multiple transactions can happen in a day or even none at all)

So let's create this table, the easiest was is through DAX calendarauto function. This will cover our dateframe which is existing in our dataset.

The main date column has to be set in DATE FORMAT. NOT DATETIME. Make sure you have in DATE format through any means necessary.

When this is done, you need to create the relationship between this, and your tables. This is done in a relatively easy way as ...

Tovább Szólj hozzá

2021. dec 07.

Power BI playing with dates

írta: dataanalyticsdemo

Data source: Superstore sample

Filter Sum of profit AFTER A SPECIFIC DATE with only GUI options:

Profit is set to whole number just for readability.

--------------------------------------------------

Filter with parameter and DAX formulas:

1. Add paramter

2. Convert parameter to query for referencing it in the formula:

3. Create DAX formula

4. Check if results are the same

Excel check returns the same,so it works.

----------------------------------------------------------------------

Use filters to select specific data:

Some other touches:

Tovább Szólj hozzá

2021. dec 02.

PBI replace null with vlaue from other column

írta: dataanalyticsdemo

Replace_from_other_column = Table.ReplaceValue(#"Changed Type",null, each [Profit] , Replacer.ReplaceValue,{"Technical_test_col"})

Tovább Szólj hozzá

2021. dec 02.

PowerBI GUI vs Advanced editor

írta: dataanalyticsdemo

Filtering rows:

Grouping:

#"Grouped Rows" = Table.Group(#"Changed Type", {"State"}, {{"c_Pofit_by_state", each List.Sum([Profit]), type nullable number}})

Note: this is pretty slow when doing it from the editor, gui pivot is much faster.

Tovább Szólj hozzá

2021. nov 20.

COVID CASES HU 2020 vs 2021 comparison

írta: dataanalyticsdemo

From September 1 to Nov 19

What is going on?

Table:

country_region	date_20	new_cases_20	date_21	confirmed_21	new_cases_21
Hungary	2020-09-02	365	2021-09-02	812793	262
Hungary	2020-09-03	301	2021-09-03	813040	247
Hungary	2020-09-06	495	2021-09-06	813688	648
Hungary	2020-09-07	576	2021-09-07	813818	130
Hungary	2020-09-08	341	2021-09-08	814064	246
Hungary	2020-09-09	411	2021-09-09	814409	345
Hungary	2020-09-10	476	2021-09-10	814732	323
Hungary	2020-09-13	484	2021-09-13	815605	873
Hungary	2020-09-14	844	2021-09-14	815851	246
Hungary	2020-09-15	726	...

Tovább Szólj hozzá

2021. okt 26.

Set interval between 7AM yestarday and 7AM today

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. okt 26.

COVID NEW CASES HU

írta: dataanalyticsdemo

Reason for certain day's spike

All days with abnormally high case counts were usually Mondays. No Saturday or Sunday in the report.

This means Mondays contain 3 days worth of new cases.

Tovább Szólj hozzá

2021. okt 21.

COVID NEW CASES HU daily 2021.09.21-2021.10.20

írta: dataanalyticsdemo

select *
from (
select country_region, date, confirmed, deaths
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND
date between '2021-09-21' AND (select max(date) from
`bigquery-public-data.covid19_jhu_csse.summary`)
)
WHERE new_cases > 0
ORDER BY 2 asc

Tovább Szólj hozzá

2021. sze 22.

COVID NEW CASES HU daily 2021.08.01-2021.09.21

írta: dataanalyticsdemo

select *
from (
select country_region, date, confirmed, deaths, recovered, active
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND
date between '2021-07-30' AND (select max(date) from
`bigquery-public-data.covid19_jhu_csse.summary`)
)
WHERE new_cases >0
ORDER BY 2 asc

Tovább Szólj hozzá

2021. sze 13.

COVID Cases Hungary

írta: dataanalyticsdemo

select *
from (
select country_region, date, confirmed, deaths, recovered, active
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'HUNGARY' AND
date between '2021-07-30' AND (select max(date) from
`bigquery-public-data.covid19_jhu_csse.summary`)
)
WHERE new_cases >0
ORDER BY 2 asc

Tovább Szólj hozzá

2021. aug 23.

Israel Covid Cases (Google Public data, BigQuery)

írta: dataanalyticsdemo

More testing maybe? From single digits to quadruple digits in 2 months :(

select country_region, date, confirmed, deaths, recovered, active
,LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day
,confirmed - LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) as new_cases
from `bigquery-public-data.covid19_jhu_csse.summary`
WHERE upper(country_region) = 'ISRAEL' AND
date between '2021-06-01' AND (select max(date) from
`bigquery-public-data.covid19_jhu_csse.summary`)

Tovább Szólj hozzá

COVID-19

2021. aug 11.

BigQuery Trim column from a specific character

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. aug 05.

Excel: Align all columns in all sheets VBA

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. aug 04.

Analytics data in BQ with variables

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. aug 02.

Test your vocabulary knowledge with Python

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. aug 02.

International vs Domestic Sales Solution

írta: dataanalyticsdemo

Source :

https://advancedsqlpuzzles.files.wordpress.com/2018/12/advanced-sql-puzzles.pdf

Tovább Szólj hozzá

2021. júl 30.

Learning Unpivot Function

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 28.

First and Last Solution

írta: dataanalyticsdemo

Source :

https://advancedsqlpuzzles.files.wordpress.com/2018/12/advanced-sql-puzzles.pdf

Tovább Szólj hozzá

2021. júl 27.

Mission to Mars solution

írta: dataanalyticsdemo

Source :

https://advancedsqlpuzzles.files.wordpress.com/2018/12/advanced-sql-puzzles.pdf

Tovább Szólj hozzá

2021. júl 26.

BQ Public Data cfpb_complaints

írta: dataanalyticsdemo

But why would you do this convoluted solution?

Which companies are most involved?

Tovább Szólj hozzá

2021. júl 24.

Google Colab, Python, Bigquery conn

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 19.

HU Covid-19 cases III.

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 18.

Texas Covid cases VI.

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 10.

Bigquery Split Words with Regex

írta: dataanalyticsdemo

with prep as (
select 'John Doe' as name union all select 'Jane Doe' union all select 'ABC DEF'
)
select *
,Regexp_extract(name,r'^(?:[^\s]*\s){0}([^\s]*)\s?') as Word0
,Regexp_extract(name,r'^(?:[^\s]*\s){1}([^\s]*)\s?') as Word1
from prep

If there are more parts, there are no errors to those who have less:

Great, I love it.

Tovább Szólj hozzá

2021. júl 06.

Employee and Manager Salaries SOLUTION

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 05.

Customer Revenue In March & Highest Target Under Manager SOLUTION

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 05.

Highest energy consumption solution

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. júl 02.

Replacee values Google Sheet Script (JS)

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. jún 28.

Premium vs Freemium SOLUTION

írta: dataanalyticsdemo

Source: StrataScratch

Tovább Szólj hozzá

2021. jún 26.

Top Cool Votes & Average Salaries SOLUTION

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. jún 24.

Highest salary in department SOLUTION

írta: dataanalyticsdemo

Coding challenges source: StrataScratch

Tovább Szólj hozzá

2021. jún 24.

Highest Number Of Orders & Popularity of Hack -- SOLUTION

írta: dataanalyticsdemo

Coding challenges source: StrataScratch

Tovább Szólj hozzá

2021. jún 23.

Acceptance Rate By Date SOLUTION

írta: dataanalyticsdemo

Source : StrataScratch

Download solution

Tovább Szólj hozzá

2021. jún 09.

Visualising exam results

írta: dataanalyticsdemo

From:
https://www.kaggle.com/spscientist/students-performance-in-exams?select=StudentsPerformance.csv

Tovább Szólj hozzá

2021. jún 09.

Texas Covid cases V.

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. jún 04.

HU COVID cases II.

írta: dataanalyticsdemo

with check_countries as (
SELECT count(distinct country_region)
FROM `bigquery-public-data.covid19_jhu_csse.summary` LIMIT 1000
),
Hu_filter as (
select 'HU' as country, * from `bigquery-public-data.covid19_jhu_csse.summary`
where upper(country_region) = 'HUNGARY'
),
filtered as (
select date, confirmed, deaths, recovered, active
, LAG(confirmed, 1) OVER (PARTITION BY country ORDER BY date ASC) AS prev_day
from Hu_filter
WHERE date between '2021-03-01' and (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
)
select *,
confirmed - prev_day as new_cases
--,round(1-(deaths/confirmed),4) as survival_rate
from filtered
where confirmed - prev_day is not null
order by date asc

Tovább Szólj hozzá

2021. jún 02.

Generate Date Array BigQuery

írta: dataanalyticsdemo

with generate_months as (
SELECT * FROM
UNNEST(GENERATE_DATE_ARRAY('2020-01-01', '2021-12-1', INTERVAL 1 MONTH)) AS dates
)

select 1 as key, left(cast(dates as string),7) as yearmonth
from generate_months

Tovább Szólj hozzá

2021. máj 28.

Leetcode 178: Rank scores: My Solution

írta: dataanalyticsdemo

Free sample test question III.

with ranked as (
select *
,dense_rank() over (order by score desc) as rn
from scores
)

select score, rn as 'Rank'
from ranked
order by score desc

Tovább Szólj hozzá

2021. máj 28.

Leetcode 185: Department Top Three Salaries: My Solution

írta: dataanalyticsdemo

Free sample test question II.

with combined as (
select E.*, D.name as dep_name
from employee E
left join department D
on E.departmentid = d.id
),

ranked as (
select *
,dense_rank() over (partition by dep_name order by salary desc) as rn
from combined
)

select
dep_name as department,
name as Employee,
salary as Salary
from ranked where rn <= 3

Tovább Szólj hozzá

2021. máj 28.

Multiple tables in FROM

írta: dataanalyticsdemo

Inspired by Leetcode "Consecutive numbers" question

Let's take this example table first:

Results in simple 8 lines:

When I use the same table 3 times in the FROM clause:

512 rows..But why? 8*8*8 = 512.

So you need to narrow that combination down with the commented part:

Now we have the same table, 3 times, without the weired combinations.

Tovább Szólj hozzá

2021. máj 27.

Gmail automation with Google Scripts I.

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 25.

SEC Public Data in BigQuery

írta: dataanalyticsdemo

This is a partition table.. let's see what values can we partition by then:

Test:

How many companies:

SIC codes:

Getting all data:

Tovább Szólj hozzá

2021. máj 25.

Basic SQL test questions

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 25.

This is so cool

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 25.

More GA data comparison

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 19.

Checking some data about visitors

írta: dataanalyticsdemo

So nothing was viewed specifically

Browser versions:

Tovább Szólj hozzá

2021. máj 19.

Site scraping I.

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 18.

Python and access connection basics

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 13.

COVID HU cases decreasing

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 13.

Postgres mini exercises

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 04.

Penguins!

írta: dataanalyticsdemo

Which islands and species?

Heaviest one of all of them?

Tovább Szólj hozzá

2021. máj 03.

Get the value of the text in the form

írta: dataanalyticsdemo

Demo for the other post

Get the value of the text in the form, TRY IT OUT:

Tovább Szólj hozzá

JavaScript TagManager

2021. máj 03.

India COVID spike in Bigquery

írta: dataanalyticsdemo

with check_countries as (
SELECT count(distinct country_region)
FROM `bigquery-public-data.covid19_jhu_csse.summary` LIMIT 1000
),

India_filter as (
select
country_region,
date,
sum(confirmed) as confirmed,
sum(deaths) as deaths,
sum(recovered) as recovered,
sum(active) as active
FROM `bigquery-public-data.covid19_jhu_csse.summary`
where upper(country_region) = 'INDIA'
GROUP BY 1,2
),

filtered as (
select date, confirmed, deaths, recovered, active
, LAG(confirmed, 1) OVER (PARTITION BY country_region ORDER BY date ASC) AS prev_day
from India_filter
WHERE date between '2021-02-01' and (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
),

usual_tbl as (
select *,
confirmed - prev_day as new_cases
--,round(1-(deaths/confirmed),4) as survival_rate
from ...

Tovább Szólj hozzá

2021. máj 03.

Rank vs dense rank

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. máj 03.

Get text value of form on website with JS

írta: dataanalyticsdemo

For Tagmanager or just basic JS/HTML exercise:

function GiveBackName() {

let inputvalue = document.getElementById("fname").value
alert(inputvalue)
console.log(inputvalue)

// for TagManager you just need:
// return document.getElementById("fname").value

}

</script>

Tovább Szólj hozzá

JavaScript TagManager

2021. ápr 29.

Coderbyte SQL member count solution

írta: dataanalyticsdemo

Free sample test question

SELECT
ReportsTo
,count(id) as Members
,round(avg(age)) as `Average Age`
FROM maintable_R70YE
where ReportsTo is not null
group by ReportsTo
order by ReportsTo asc

Tovább Szólj hozzá

2021. ápr 24.

Texas new COVID cases IV.

írta: dataanalyticsdemo

And the results are:

Good news, seems to be still dropping more and more.

Tovább Szólj hozzá

2021. ápr 23.

Import Sheets to BigQuery

írta: dataanalyticsdemo

Tovább Szólj hozzá

SQL Bigquery Google Sheets

2021. ápr 23.

SQL in Google Sheets

írta: dataanalyticsdemo

Tovább Szólj hozzá

public SQL Analytics Google Sheets

2021. ápr 21.

Chicago crime data 2015-2016

írta: dataanalyticsdemo

Tovább Szólj hozzá

public SQL Bigquery

2021. ápr 19.

Transactions by browser in July 2017

írta: dataanalyticsdemo

Analytics sample in BQ

SELECT
device.browser,
SUM ( totals.transactions ) AS total_transactions
FROM `bigquery-public-data.google_analytics_sample.ga_sessions_*`
WHERE
_TABLE_SUFFIX BETWEEN '20170701' AND '20170731'
GROUP BY device.browser
HAVING SUM ( totals.transactions ) > 0
ORDER BY
total_transactions DESC

Tovább Szólj hozzá

public SQL Analytics Bigquery TABLE SUFFIX

2021. ápr 19.

What tables do I have in this dataset in BQ?

írta: dataanalyticsdemo

List them!

More tables, like with Google Analytics:

Get latest (freshest date/table)

It is so META good.

SELECT max(table_id) as latest_available_day_tbl
FROM `bigquery-public-data.google_analytics_sample.__TABLES_SUMMARY__`

Tovább Szólj hozzá

2021. ápr 16.

Count occurence of substring in string Bigquery

írta: dataanalyticsdemo

Tovább Szólj hozzá

2021. ápr 14.

Convert multiple rows to one row by ID in BigQuery

írta: dataanalyticsdemo

Using STRING_AGG(CONCAT( colname , ''))

Tovább Szólj hozzá

public SQL Bigquery String agg

2021. ápr 10.

HU Covid cases

írta: dataanalyticsdemo

with check_countries as (
SELECT count(distinct country_region)
FROM `bigquery-public-data.covid19_jhu_csse.summary` LIMIT 1000
),
Hu_filter as (
select 'HU' as country, * from `bigquery-public-data.covid19_jhu_csse.summary`
where upper(country_region) = 'HUNGARY'
),
filtered as (
select date, confirmed, deaths, recovered, active
, LAG(confirmed, 1) OVER (PARTITION BY country ORDER BY date ASC) AS prev_day
from Hu_filter
WHERE date between '2020-03-01' and (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
)
select *,
confirmed - prev_day as new_cases
--,round(1-(deaths/confirmed),4) as survival_rate
from filtered
order by date asc

Tovább Szólj hozzá

public SQL COVID-19 Bigquery

2021. ápr 08.

RO Covid cases

írta: dataanalyticsdemo

with check_countries as (
SELECT count(distinct country_region)
FROM `bigquery-public-data.covid19_jhu_csse.summary` LIMIT 1000
),
Ro_filter as (
select 'HU' as country, * from `bigquery-public-data.covid19_jhu_csse.summary`
where upper(country_region) = 'ROMANIA'
),
filtered as (
select date, confirmed, deaths, recovered, active
, LAG(confirmed, 1) OVER (PARTITION BY country ORDER BY date ASC) AS prev_day
from Hu_filter
WHERE date between '2021-01-01' and (select max(date) from `bigquery-public-data.covid19_jhu_csse.summary`)
)
select *,
confirmed - prev_day as new_cases
--,round(1-(deaths/confirmed),4) as survival_rate
from filtered
order by date asc

Tovább Szólj hozzá

public SQL COVID-19 Bigquery

2021. ápr 05.

Arrays to DataFrame with Pandas

írta: dataanalyticsdemo

Basic example

JOIN 2 DATAFRAMES

Source:

https://www.shanelynn.ie/merge-join-dataframes-python-pandas-index-1/

Tovább Szólj hozzá

Python Pandas

2021. ápr 02.

Random fictive store data in PowerBI

írta: dataanalyticsdemo

Been doing something similar at an internship

Tovább Szólj hozzá

2021. már 30.

Texas new COVID cases until 03.29

írta: dataanalyticsdemo

with prep as (
SELECT *
, LAG(confirmed_cases, 1)
OVER (PARTITION BY state_name ORDER BY date ASC) AS prev_day
FROM `bigquery-public-data.covid19_nyt.us_states`
where upper(state_name) = 'TEXAS'
and date >= '2021-03-01'
order by date asc
)

select date
, confirmed_cases - prev_day as new_cases --since the prev.day
from prep
order by date asc

Tovább Szólj hozzá