Extract top blocked domain from pi-hole using bash So, I wanted to look around and see what are my top blocked domains on my local network. As far as I know, web interface of pi-hole is offering to just show you top N blocked domains. I wanted them all. So, once I figured out it is all in log files in `/var/log/`, it all boiled down to figure out how to detect blocked domains and then it was just one simple script: ``` cat pihole.log | grep "0.0.0.0$" | sed -e "s/.*\s\(.*\)\sis 0.0.0.0/\1/g" | sort | uniq -c | sort -nr ``` Output of the script for me looks like: 790 mobile.pipe.aria.microsoft.com 116 winatp-gw-eus.microsoft.com 52 us-v20.events.data.microsoft.com 50 tracker.grepler.com 50 ads.viber.com 47 v10c.events.data.microsoft.com 46 settings-win.data.microsoft.com 40 tracker.trackerfix.com 16 nexusrules.officeapps.live.com 6 browser.pipe.aria.microsoft.com 4 watson.telemetry.microsoft.com 3 v10.events.data.microsoft.com 2 nexus.officeapps.live.com 2 az416426.vo.msecnd.net 1 vortex.data.microsoft.com Yes, there are lot of Windows machines on my network:) But since I use ad-tracker, there you cannot see lot of ads blocked. Here is another example from another of mine pihole servers. This one is: * outside of my network, * has lot more of lists obtained from this [awesome aggregator](https://firebog.net/), * family members are not the only users here, and * is used in conjuction with openVPN, so I am covered on my mobile too So, here it is: 2516 mobile.pipe.aria.microsoft.com 54 app-measurement.com 31 reports.crashlytics.com 8 www.googletagmanager.com 8 www.google-analytics.com 4 www.googletagservices.com 4 static.chartbeat.com 4 js-agent.newrelic.com 4 googleads.g.doubleclick.net 4 dev.visualwebsiteoptimizer.com 3 sb.scorecardresearch.com 3 cdn.optimizely.com 2 static.doubleclick.net 2 settings.crashlytics.com 2 secure-us.imrworldwide.com 2 s.webtrends.com 2 realtime.services.disqus.com 2 pagead2.googlesyndication.com 2 logs-01.loggly.com 2 load.sumome.com 2 d1z2jf7jlzjs58.cloudfront.net 2 cdn.segment.io 2 c1.rfihub.net 2 c.amazon-adsystem.com 2 ads.mopub.com 1 www.zergnet.com 1 video.adaptv.advertising.com 1 stats.wp.com 1 ssl.google-analytics.com 1 secure.quantserve.com 1 s.skimresources.com 1 s.adroll.com 1 referrer.disqus.com 1 platform.tumblr.com 1 pixel.wp.com 1 p.typekit.net 1 nexusrules.officeapps.live.com 1 mads.amazon-adsystem.com 1 live.sekindo.com 1 hello.myfonts.net 1 experience.contextly.com 1 events.redditmedia.com 1 engine.adzerk.net 1 device-metrics-us.amazon.com 1 d3ezl4ajpp2zy8.cloudfront.net 1 cx.atdmt.com 1 collector-medium.lightstep.com 1 cdn.simpleanalytics.io 1 cdn.onesignal.com 1 api.branch.io 1 ads.adthrive.com 1 ad-delivery.net Stay safe online, friends;) tags: bash,bash-magic,pihole,pi-hole