theDataShark

Why Server Hardening Cannot Be Optional

In 2024, automated bots scan the entire IPv4 address space continuously. A freshly provisioned cloud VPS with SSH exposed on port 22 will see login attempts within minutes of boot. Within 24 hours, thousands of brute-force attempts are normal.

Default Ubuntu installations are not insecure by design — but they are permissive by default. Every unnecessary open port, every weak configuration, every unused service is an attack surface. This guide covers every critical hardening step with real commands and explanations for each decision.

⚠ Work on a staging server first. SSH hardening steps, if done incorrectly, can lock you out permanently. Always keep a console session open while making SSH changes.

Step 1 — Initial System Update & User Setup

bash

# Update all packages and install security essentials
sudo apt update && sudo apt upgrade -y
sudo apt install -y ufw fail2ban unattended-upgrades auditd \
  audispd-plugins libpam-pwquality curl vim

# Create dedicated non-root admin user
adduser sysadmin
usermod -aG sudo sysadmin

# Verify sudo access
su - sysadmin
sudo whoami   # should return: root

Step 2 — SSH Hardening

SSH is the primary attack vector. Set up key authentication first, then lock down /etc/ssh/sshd_config.

bash — local machine

# Generate an Ed25519 key pair on your LOCAL machine
ssh-keygen -t ed25519 -C "admin@thedatashark.com" -f ~/.ssh/shark_ed25519

# Copy public key to server
ssh-copy-id -i ~/.ssh/shark_ed25519.pub sysadmin@YOUR_SERVER_IP

# Test BEFORE changing any SSH settings
ssh -i ~/.ssh/shark_ed25519 sysadmin@YOUR_SERVER_IP

sshd_config

# /etc/ssh/sshd_config.d/99-hardening.conf
Port                    2222
PermitRootLogin         no
PasswordAuthentication  no
PubkeyAuthentication    yes
AuthenticationMethods   publickey
AllowUsers              sysadmin
MaxAuthTries            3
ClientAliveInterval     300
ClientAliveCountMax     2
X11Forwarding           no
AllowTcpForwarding      no
LogLevel                VERBOSE
Ciphers   chacha20-poly1305@openssh.com,aes256-gcm@openssh.com
MACs      hmac-sha2-512-etm@openssh.com,hmac-sha2-256-etm@openssh.com

bash

# Validate config before restarting
sudo sshd -t

# Restart (keep current session open!)
sudo systemctl restart sshd

# Test new connection from a SECOND terminal before closing current one
ssh -i ~/.ssh/shark_ed25519 -p 2222 sysadmin@YOUR_SERVER_IP

⚠ Critical: Open a second terminal and test the new connection before closing your existing session. If it fails, you still have the old session to fix it.

Step 3 — Firewall Configuration with UFW

Default deny everything, then explicitly allow only what you need.

bash

# Set default policies
sudo ufw default deny incoming
sudo ufw default allow outgoing

# Allow SSH on the new port FIRST
sudo ufw allow 2222/tcp comment 'SSH hardened port'
sudo ufw allow 80/tcp   comment 'HTTP'
sudo ufw allow 443/tcp  comment 'HTTPS'

# Enable rate limiting on SSH
sudo ufw limit 2222/tcp

# Enable — do not run this before allowing SSH!
sudo ufw enable
sudo ufw status verbose

Step 4 — Brute-Force Protection with fail2ban

ini — /etc/fail2ban/jail.d/ssh-hardened.conf

[sshd]
enabled  = true
port     = 2222
maxretry = 3
bantime  = 86400   # 24-hour ban
findtime = 600
backend  = systemd

bash

sudo systemctl restart fail2ban && sudo systemctl enable fail2ban
sudo fail2ban-client status sshd

# View current bans
sudo fail2ban-client get sshd banip

# Unban yourself if locked out
sudo fail2ban-client set sshd unbanip YOUR_IP

Step 5 — Automatic Security Updates

conf — /etc/apt/apt.conf.d/20auto-upgrades

APT::Periodic::Update-Package-Lists "1";
APT::Periodic::Download-Upgradeable-Packages "1";
APT::Periodic::AutocleanInterval "7";
APT::Periodic::Unattended-Upgrade "1";

Step 6 — Kernel Parameter Hardening (sysctl)

conf — /etc/sysctl.d/99-hardening.conf

# Network hardening
net.ipv4.ip_forward=0
net.ipv4.conf.all.send_redirects=0
net.ipv4.conf.all.accept_redirects=0
net.ipv4.tcp_syncookies=1
net.ipv4.icmp_echo_ignore_broadcasts=1
net.ipv4.conf.all.rp_filter=1

# Kernel hardening
kernel.dmesg_restrict=1
kernel.kptr_restrict=2
kernel.yama.ptrace_scope=1
kernel.sysrq=0

# Filesystem hardening
fs.protected_hardlinks=1
fs.protected_symlinks=1
fs.suid_dumpable=0

bash

sudo sysctl -p /etc/sysctl.d/99-hardening.conf

Step 7 — Audit Logging with auditd

conf — /etc/audit/rules.d/hardening.rules

# Monitor authentication files
-w /etc/passwd  -p wa -k identity
-w /etc/shadow  -p wa -k identity
-w /etc/sudoers -p wa -k sudoers

# Monitor SSH config changes
-w /etc/ssh/sshd_config -p wa -k ssh_config

# Log all sudo commands
-a always,exit -F arch=b64 -S execve -F euid=0 -F auid>=1000 -F auid!=-1 -k root_commands

# Make audit rules immutable (requires reboot to change)
-e 2

bash

sudo augenrules --load
sudo auditctl -s
sudo ausearch -k identity --start today

Step 8 — Disable Unnecessary Services

bash

# List all enabled services
sudo systemctl list-unit-files --state=enabled --type=service

# Disable services not needed on a server
sudo systemctl disable --now avahi-daemon   # mDNS
sudo systemctl disable --now cups           # Printing
sudo systemctl disable --now bluetooth      # Bluetooth
sudo systemctl disable --now whoopsie       # Crash reporting

# Check what's listening on network ports
sudo ss -tulnp

Bonus — Full Automated Hardening Script

✓ How to use: Save as harden.sh, review and edit the VARIABLES section at the top, then run sudo bash harden.sh. Every action is logged to /var/log/harden-DATE.log.

bash — harden.sh

#!/bin/bash
# harden.sh — Ubuntu 22.04 Server Hardening Script
# theDataShark.com — Review VARIABLES before running!
set -euo pipefail

ADMIN_USER="sysadmin"
SSH_PORT="2222"
LOG_FILE="/var/log/harden-$(date +%Y%m%d-%H%M%S).log"
exec > >(tee -a "$LOG_FILE") 2>&1

log() { echo "[$(date '+%Y-%m-%d %H:%M:%S')] $*"; }
log "=== theDataShark Hardening Script ==="

## 1. Update
apt update -q && apt upgrade -y -q
apt install -y -q ufw fail2ban unattended-upgrades auditd

## 2. UFW
ufw --force reset
ufw default deny incoming; ufw default allow outgoing
ufw allow "${SSH_PORT}/tcp" comment "SSH"
ufw allow 80/tcp; ufw allow 443/tcp
ufw limit "${SSH_PORT}/tcp"
ufw --force enable
log "UFW enabled."

## 3. SSH
cp /etc/ssh/sshd_config "/etc/ssh/sshd_config.bak.$(date +%Y%m%d)"
cat > /etc/ssh/sshd_config.d/99-hardening.conf <## 4. fail2ban
cat > /etc/fail2ban/jail.d/ssh-hardened.conf <## 5. sysctl
cat > /etc/sysctl.d/99-hardening.conf <## 6. Auto updates
cat > /etc/apt/apt.conf.d/20auto-upgrades <



        Quick Reference Checklist
        
          // Ubuntu 22.04 Server Hardening Checklist
          ☑System fully updated — apt update && apt upgrade
          ☑Non-root admin user created with sudo access
          ☑SSH key pair generated and copied to server
          ☑SSH port changed from 22 to custom port
          ☑Root SSH login disabled — PermitRootLogin no
          ☑Password authentication disabled
          ☑UFW enabled with default deny incoming
          ☑UFW rate limiting on SSH port enabled
          ☑fail2ban installed, configured, and active
          ☑Unattended security upgrades enabled
          ☑sysctl hardening parameters applied
          ☑auditd installed and audit rules loaded
          ☑Unnecessary services disabled
        

        Wrapping Up
        Server hardening is not a one-time task. Run sudo lynis audit system periodically to get a scored security report. Aim for 70+ on a standard web server. The steps in this guide stop the vast majority of automated attacks before they even get started.
        → Want this as a downloadable PDF checklist? The Complete IT Infra Bundle includes this hardening guide, the Kubernetes security checklist, and a Bash scripting workbook in clean PDF format.

The Right Mindset for AWS Cost Optimization

AWS cost optimization is not about being cheap. It's about paying for what you actually use, at the right commitment level, with the right pricing model. The difference between a $5,000/month bill and a $3,000/month bill for the same infrastructure is almost entirely down to matching your purchasing model to your actual usage patterns.

→ Before you start: You'll need AWS CLI configured (aws configure) and IAM permissions for Cost Explorer, EC2, S3, and RDS. Most commands use the AWS CLI — install and configure it first.

Step 1 — Get Visibility: Cost Explorer & Budget Alerts

bash — AWS CLI

# Cost breakdown by service for last month
aws ce get-cost-and-usage \
  --time-period Start=$(date -d "last month" +%Y-%m-01),End=$(date +%Y-%m-01) \
  --granularity MONTHLY \
  --metrics "BlendedCost" \
  --group-by Type=DIMENSION,Key=SERVICE \
  --query 'ResultsByTime[].Groups[].{Service:Keys[0],Cost:Metrics.BlendedCost.Amount}' \
  --output table

# Create a $500/month budget with 80% alert
aws budgets create-budget \
  --account-id $(aws sts get-caller-identity --query Account --output text) \
  --budget '{"BudgetName":"MonthlySpend","BudgetLimit":{"Amount":"500","Unit":"USD"},"TimeUnit":"MONTHLY","BudgetType":"COST"}' \
  --notifications-with-subscribers '[{"Notification":{"NotificationType":"ACTUAL","ComparisonOperator":"GREATER_THAN","Threshold":80},"Subscribers":[{"SubscriptionType":"EMAIL","Address":"you@thedatashark.com"}]}]'

✓ Pro Tip: Tag everything in AWS with at minimum Environment (prod/staging/dev) and Project. Cost allocation tags let you break down spend by project in Cost Explorer — without them, you're flying blind.

Step 2 — EC2 Rightsizing with Compute Optimizer

bash

# Opt in to Compute Optimizer (free)
aws compute-optimizer update-enrollment-status --status Active

# Get rightsizing recommendations
aws compute-optimizer get-ec2-instance-recommendations \
  --query 'instanceRecommendations[].{
    Instance:instanceArn,
    Current:currentInstanceType,
    Finding:finding,
    Recommended:recommendationOptions[0].instanceType,
    MonthlySavings:recommendationOptions[0].estimatedMonthlySavings.value
  }' \
  --output table

# Check average CPU utilisation over 30 days
aws cloudwatch get-metric-statistics \
  --namespace AWS/EC2 \
  --metric-name CPUUtilization \
  --dimensions Name=InstanceId,Value=i-0123456789abcdef0 \
  --start-time $(date -d "30 days ago" --utc +%FT%TZ) \
  --end-time $(date --utc +%FT%TZ) \
  --period 86400 --statistics Average Maximum \
  --query 'sort_by(Datapoints,&Timestamp)[].{Date:Timestamp,Avg:Average,Max:Maximum}' \
  --output table

Average CPU %	Recommendation	Typical Saving
< 5%	Downsize 2 sizes or switch to t-series burstable	40–60%
5–20%	Downsize 1 instance size	20–40%
20–60%	Current size appropriate — focus on pricing model	0% from resize
> 80%	Consider upsize or horizontal scaling	—

Step 3 — Spot Instances: 60–90% Cheaper

Spot Instances use spare EC2 capacity at up to 90% discount. AWS can reclaim them with 2 minutes notice. Ideal for: batch processing, CI/CD build agents, stateless web tier, dev/test environments.

bash

# Check current Spot vs On-Demand pricing
aws ec2 describe-spot-price-history \
  --instance-types m5.xlarge \
  --product-descriptions "Linux/UNIX" \
  --start-time $(date --utc +%FT%TZ) \
  --query 'SpotPriceHistory[].{AZ:AvailabilityZone,Price:SpotPrice}' \
  --output table

# Launch Spot with interruption handling
aws ec2 run-instances \
  --image-id ami-0abcdef1234567890 \
  --instance-type m5.xlarge \
  --instance-market-options '{"MarketType":"spot","SpotOptions":{"SpotInstanceType":"one-time","InstanceInterruptionBehavior":"terminate"}}' \
  --tag-specifications 'ResourceType=instance,Tags=[{Key=Name,Value=spot-worker}]'

✓ Best practice: Use EC2 Auto Scaling with mixed instances policy rather than individual Spot requests. Configure at least 3–5 instance types across 2–3 AZs to maximise availability and minimise interruptions.

Step 4 — Savings Plans & Reserved Instances

Plan Type	Flexibility	Max Discount	Best For
Compute Savings Plan	Any EC2/Lambda/Fargate	66%	Most workloads
EC2 Instance Savings Plan	Instance family in 1 region	72%	Stable EC2
Standard Reserved Instance	Exact type/region	72%	Known stable workloads
Convertible RI	Can change family	54%	Moderate flexibility

✓ Purchase strategy: Start with a 1-year Compute Savings Plan at no upfront. Cover only your baseline — keep 20–30% on-demand or Spot for variable load. You can always buy more later.

// Recommended — Domain & Hosting

Deploy Your Cloud Projects with a Real Domain

Every production deployment needs a domain and proper SSL. These are the services we use.

🔒 Save 40–70% on SSL Certs

Secure every endpoint behind your load balancer. DV, Wildcard, OV.

Get SSL →

* Affiliate links — small commission at no extra cost to you.

Step 5 — S3 Cost Optimization: Storage Classes & Lifecycle Policies

json — lifecycle-policy.json

{
  "Rules": [{
    "ID": "auto-tiering",
    "Status": "Enabled",
    "Filter": { "Prefix": "" },
    "Transitions": [
      { "Days": 30,  "StorageClass": "STANDARD_IA" },
      { "Days": 90,  "StorageClass": "INTELLIGENT_TIERING" },
      { "Days": 365, "StorageClass": "GLACIER_IR" }
    ],
    "NoncurrentVersionExpiration": { "NoncurrentDays": 90 },
    "AbortIncompleteMultipartUpload": { "DaysAfterInitiation": 7 }
  }]
}

bash

aws s3api put-bucket-lifecycle-configuration \
  --bucket your-bucket-name \
  --lifecycle-configuration file://lifecycle-policy.json

Step 6 — RDS Cost Cuts

Use Aurora Serverless v2 for dev/staging — scales to 0.5 ACUs when idle
Stop non-production instances outside working hours using EventBridge + Lambda
Purchase RDS Reserved Instances for 24/7 production databases — 30–60% discount
Migrate gp2 EBS to gp3 — same performance, 20% cheaper, takes 2 CLI commands
Enable storage auto-scaling with a max limit rather than provisioning the maximum upfront

Step 7 — Data Transfer: The Hidden Cost Nobody Talks About

Transferring data into AWS is free. Transferring data out to the internet costs $0.09/GB. Cross-region is $0.02/GB. Key strategies:

Use CloudFront — egress pricing is cheaper, and it caches content globally
Keep traffic in the same AZ — cross-AZ costs $0.01/GB each way
Use VPC Endpoints for S3 and DynamoDB — eliminates NAT Gateway processing costs

Step 8 — Automate Stop-Start Schedules

bash — EventBridge stop/start rules

# Stop dev instances at 8pm IST (14:30 UTC) Mon-Fri
aws events put-rule \
  --name "StopDevInstances" \
  --schedule-expression "cron(30 14 ? * MON-FRI *)" \
  --state ENABLED

# Start dev instances at 9am IST (03:30 UTC) Mon-Fri
aws events put-rule \
  --name "StartDevInstances" \
  --schedule-expression "cron(30 3 ? * MON-FRI *)" \
  --state ENABLED

# Quick manual stop of all dev instances
aws ec2 describe-instances \
  --filters "Name=tag:Environment,Values=dev" "Name=instance-state-name,Values=running" \
  --query 'Reservations[].Instances[].InstanceId' --output text | \
  xargs aws ec2 stop-instances --instance-ids

AWS Cost Optimization Checklist

Action	Service	Typical Impact	Effort
Enable Cost Explorer + budget alerts	All	Visibility	Low
Tag all resources	All	Visibility	Low
Enable Compute Optimizer	EC2	Sizing data	Low
Downsize instances <10% CPU avg	EC2	20–60%	Medium
Move batch/CI to Spot	EC2	60–90%	Medium
Buy 1-year Compute Savings Plan	EC2/Lambda	30–66%	Low
Add S3 lifecycle policies	S3	20–70%	Low
Add VPC Endpoints for S3/DynamoDB	Data transfer	10–30%	Low
Stop non-prod nights/weekends	EC2/RDS	65–70%	Medium
Migrate gp2 EBS → gp3	EBS	20%	Low
Delete unattached EBS + old snapshots	EBS	Variable	Low

Wrapping Up

A methodical sweep through these strategies routinely delivers 30–50% savings with no degradation in performance or reliability. Start with Cost Explorer and Compute Optimizer — both free. Let the data tell you where money is going before committing to purchasing changes.

→ Need more? The Cloud Monitoring & Alerting eBook includes a dedicated chapter on AWS cost alerting, Savings Plans analysis, and automated cost anomaly detection.

How Docker Networking Actually Works

Docker networking is built on three Linux kernel primitives: network namespaces, virtual Ethernet pairs (veth), and Linux bridges. When you start a container, Docker creates a new network namespace for it. This gives the container its own isolated network stack — its own routing table, interfaces, and iptables rules.

// Docker Default Bridge Architecture

  HOST
  ┌────────────────────────────────────────────────────┐
  │  ┌─────────────┐    ┌─────────────┐               │
  │  │ Container A │    │ Container B │               │
  │  │ eth0        │    │ eth0        │               │
  │  │ 172.17.0.2  │    │ 172.17.0.3  │               │
  │  └──────┬──────┘    └──────┬──────┘               │
  │         │ veth pair        │ veth pair             │
  │  ┌──────┴──────────────────┴──────┐               │
  │  │       docker0 bridge           │               │
  │  │       172.17.0.1/16            │               │
  │  └───────────────┬────────────────┘               │
  │                  │ iptables NAT / MASQUERADE       │
  │  ┌───────────────┴────────────────┐               │
  │  │    eth0 (host NIC)  192.168.x  │               │
  │  └────────────────────────────────┘               │
  └────────────────────────────────────────────────────┘

bash

# See all Docker networks
docker network ls

# Inspect bridge network details
docker network inspect bridge

# See Docker iptables NAT rules
sudo iptables -t nat -L DOCKER -n --line-numbers

Bridge Network — The Default

The bridge driver is Docker's default. Containers on the default bridge can communicate by IP but not by container name — this is the most commonly misunderstood limitation.

bash

docker run -d --name web nginx:alpine
docker run -d --name db  postgres:15-alpine -e POSTGRES_PASSWORD=secret

# This FAILS on the default bridge — no DNS by name
docker exec web ping -c2 db
# ping: bad address 'db'

# IP works — but hardcoding IPs in production is wrong
docker inspect db --format '{{.NetworkSettings.IPAddress}}'

⚠ Don't use the default bridge in production. No DNS by name, no network isolation between unrelated containers. Always use custom named networks.

Custom Bridge Networks — The Right Way

Custom bridge networks provide automatic DNS resolution by container name, network-level isolation, and dynamic connect/disconnect.

bash

# Create a custom bridge network
docker network create \
  --driver bridge \
  --subnet 192.168.10.0/24 \
  --gateway 192.168.10.1 \
  app-network

# Start containers on the custom network
docker run -d --name web --network app-network nginx:alpine
docker run -d --name db  --network app-network postgres:15-alpine \
  -e POSTGRES_PASSWORD=secret

# DNS by name works automatically
docker exec web ping -c2 db
# PING db (192.168.10.3) — success!

# Connect an existing container to a second network
docker network connect monitoring-net web

// Recommended — Deploy Your Docker Apps

Get a Domain + SSL for Your Dockerized Projects

Once your containers are running, you need a domain and HTTPS in front of your Nginx reverse proxy.

🔒 Save 40–70% on SSL Certs

Add TLS termination to your Nginx proxy. Wildcard certs cover all subdomains.

Get SSL →

* Affiliate links — small commission at no extra cost to you.

Host Network — Maximum Performance

Removes all network isolation. The container shares the host's network stack — no veth, no bridge, no NAT overhead. Use for: Prometheus node_exporter, Datadog agent, high-throughput brokers.

bash

# No -p port mapping needed — binds directly to host port 80
docker run -d --network host --name nginx-host nginx:alpine
curl http://localhost:80   # connects directly — zero NAT overhead

Aspect	Bridge	Host
Network isolation	✓ Full isolation	✗ No isolation
Port mapping required	Yes (-p 80:80)	No — direct
Performance overhead	Small (NAT)	None
Use case	Most apps	Monitoring agents, high-throughput

Overlay Network — Multi-Host Communication

Allows containers on different Docker hosts to communicate as if on the same network. Uses VXLAN encapsulation. Requires Docker Swarm mode.

// Overlay Network Architecture

  Host 1 (192.168.1.10)      Host 2 (192.168.1.11)
  ┌─────────────────────┐    ┌─────────────────────┐
  │  ┌───────────────┐  │    │  ┌───────────────┐  │
  │  │  Container A  │  │    │  │  Container B  │  │
  │  │  10.0.0.3     │  │VXLAN│  │  10.0.0.4     │  │
  │  └───────┬───────┘  │◄───►  └───────┬───────┘  │
  │  [overlay bridge]   │UDP  │  [overlay bridge]   │
  └─────────────────────┘4789 └─────────────────────┘

bash — Swarm required

docker swarm init --advertise-addr 192.168.1.10

docker network create \
  --driver overlay \
  --subnet 10.0.0.0/24 \
  --attachable \
  app-overlay

docker service create --name web --network app-overlay --replicas 3 -p 80:80 nginx:alpine

Macvlan & IPvlan — Direct L2 Network Access

Macvlan assigns a real MAC address to each container, making it visible on your physical network. Routers see containers as standalone devices. Ideal for network appliances, legacy apps, Pi-hole DNS.

bash

# Enable promiscuous mode on host NIC
sudo ip link set eth0 promisc on

# Create macvlan network
docker network create \
  --driver macvlan \
  --subnet 192.168.1.0/24 \
  --gateway 192.168.1.1 \
  --ip-range 192.168.1.192/27 \
  --opt parent=eth0 \
  macvlan-net

# Container gets a real IP on your LAN
docker run -d --name pihole --network macvlan-net \
  --ip 192.168.1.200 pihole/pihole:latest

# Reachable from any device on your network
ping 192.168.1.200

None Network — Full Isolation

bash

# Container gets only loopback — no network access
docker run --network none --rm alpine ip addr show
# Only lo (127.0.0.1) — completely isolated

Container DNS Resolution Explained

DNS in custom networks is handled by Docker's embedded DNS server at 127.0.0.11. Every container on a custom network uses this as its resolver. It resolves container names, service names, and network aliases — falling back to the host's DNS for external queries.

bash

# Check container's DNS config
docker exec web cat /etc/resolv.conf
# nameserver 127.0.0.11

# Add network aliases — multiple names for one container
docker run -d --name primary-db \
  --network app-network \
  --network-alias database \
  --network-alias db \
  postgres:15-alpine

# DNS round-robin — run 3 containers with same alias
for i in 1 2 3; do
  docker run -d --name "api-$i" --network app-network --network-alias api nginx:alpine
done
docker run --rm --network app-network alpine nslookup api
# Returns all 3 IPs — basic load balancing via DNS

Docker Compose Networking Patterns

yaml — Multi-tier isolation pattern

services:
  # Public-facing proxy — on public-net only
  nginx:
    image: nginx:alpine
    ports: ["80:80", "443:443"]
    networks: [public-net]

  # App bridges both networks
  app:
    image: myapp:latest
    networks: [public-net, private-net]
    environment:
      - DB_HOST=postgres

  # Database — private-net only, NOT reachable from nginx
  postgres:
    image: postgres:15-alpine
    networks: [private-net]

  redis:
    image: redis:7-alpine
    networks: [private-net]

networks:
  public-net:
    driver: bridge
  private-net:
    driver: bridge
    internal: true  # No outbound internet

✓ The internal: true flag prevents outbound internet access from containers on that network. Databases should almost never have direct internet access.

Network Troubleshooting Reference

bash — Troubleshooting Toolkit

## 1. Which networks is the container on?
docker inspect CONTAINER --format '{{json .NetworkSettings.Networks}}' | python3 -m json.tool

## 2. Check IP, gateway, DNS
docker exec CONTAINER ip addr show
docker exec CONTAINER ip route show
docker exec CONTAINER cat /etc/resolv.conf

## 3. Test DNS resolution
docker exec CONTAINER nslookup OTHER_CONTAINER
docker exec CONTAINER nslookup google.com

## 4. Test TCP connectivity
docker exec CONTAINER nc -zv OTHER_CONTAINER 5432

## 5. Run a debug container (netshoot has everything)
docker run --rm -it \
  --network container:PROBLEM_CONTAINER \
  nicolaka/netshoot bash
# netshoot includes: tcpdump, nmap, curl, dig, ss, iperf3 and more

## 6. Capture traffic between containers
docker run --rm -it \
  --network container:CONTAINER_NAME \
  --cap-add NET_ADMIN \
  nicolaka/netshoot \
  tcpdump -i eth0 -nn

Symptom	Likely Cause	Check
Can't resolve names	On default bridge, not custom	`cat /etc/resolv.conf` — must show 127.0.0.11
Name resolves, connection refused	Service not listening or wrong port	`ss -tulnp` inside container
No internet from container	iptables MASQUERADE missing or `internal: true`	`iptables -t nat -L POSTROUTING`
Port not reachable from host	Port not published with `-p`	`docker port CONTAINER`
Containers on different networks can't talk	Expected — networks are isolated	Connect both to a shared network

Production Network Design Patterns

Pattern 1 — Frontend/Backend segmentation: Only the reverse proxy connects to both public and private networks. Databases are completely hidden from the internet. This is the Compose example above — use it as your default.

Pattern 2 — Per-application isolated networks: Each app gets its own network. A compromised container in App A cannot reach App B's database. Create one bridge network per application on multi-tenant servers.

Pattern 3 — Shared monitoring network: Connect Prometheus to all application networks so it can scrape metrics from every container — without those containers having access to each other.

bash — Per-app isolation

docker network create app1-net
docker network create app2-net

# App1 and App2 are completely isolated
docker run -d --name app1-web --network app1-net nginx:alpine
docker run -d --name app2-web --network app2-net nginx:alpine

# Connect Prometheus to both (monitoring access without cross-app access)
docker network connect app1-net prometheus
docker network connect app2-net prometheus

Wrapping Up

Docker networking decision tree: single host + isolation → custom bridge. Single host + performance → host. Multi-host → overlay. Physical network access → macvlan. No network → none. Always use custom bridge networks, always set internal: true for database networks, and keep nicolaka/netshoot handy for debugging.

→ Going deeper? The Kubernetes Cluster Hardening guide covers NetworkPolicies — the production-grade version of Docker network isolation for orchestrated workloads.

Most Kubernetes clusters are deployed with security as an afterthought. The defaults are permissive, tokens get mounted everywhere, and network traffic flows freely between pods. In production, this is a disaster waiting to happen.

⚠ Before you start: Apply these changes to a staging cluster first. PodSecurity enforcement and NetworkPolicies can break existing workloads if not rolled out carefully.

1. Enable and Enforce RBAC

bash

# Find all subjects with cluster-admin
kubectl get clusterrolebindings \
  -o jsonpath='{range .items[*]}{.metadata.name}{"\t"}{.roleRef.name}{"\t"}{range .subjects[*]}{.kind}/{.name}{" "}{end}{"\n"}{end}' \
  | grep cluster-admin

yaml — deploy-role.yaml

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: production
  name: deploy-manager
rules:
- apiGroups: ["apps"]
  resources: ["deployments", "replicasets"]
  verbs: ["get", "list", "watch", "update", "patch"]

✓ Pro tip: Use kubectl auth can-i --list --as=system:serviceaccount:NAMESPACE:SA to audit exactly what a service account can do before deploying.

2. Apply Pod Security Standards

bash

# Enforce restricted policy on production namespace
kubectl label namespace production \
  pod-security.kubernetes.io/enforce=restricted \
  pod-security.kubernetes.io/enforce-version=latest \
  pod-security.kubernetes.io/warn=restricted \
  pod-security.kubernetes.io/audit=restricted

yaml — restricted pod spec

securityContext:
  runAsNonRoot: true
  runAsUser: 1000
  readOnlyRootFilesystem: true
  allowPrivilegeEscalation: false
  seccompProfile:
    type: RuntimeDefault
  capabilities:
    drop: ["ALL"]

3. Implement Network Policies

By default, all pods can talk to all other pods. Start with a default-deny-all policy, then selectively open required traffic.

yaml — default-deny-all.yaml

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: default-deny-all
  namespace: production
spec:
  podSelector: {}
  policyTypes: [Ingress, Egress]
---
# Allow frontend → backend traffic only
apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: allow-frontend-to-backend
  namespace: production
spec:
  podSelector:
    matchLabels: { app: backend }
  policyTypes: [Ingress]
  ingress:
  - from:
    - podSelector:
        matchLabels: { app: frontend }
    ports: [{ protocol: TCP, port: 8080 }]

4. Encrypt Secrets at Rest

yaml — encryption-config.yaml

apiVersion: apiserver.config.k8s.io/v1
kind: EncryptionConfiguration
resources:
- resources: ["secrets"]
  providers:
  - aescbc:
      keys:
      - name: key1
        secret: <base64-encoded-32-byte-key>
  - identity: {}

bash

# Generate a 32-byte encryption key
head -c 32 /dev/urandom | base64

# Re-encrypt all existing secrets after enabling
kubectl get secrets --all-namespaces -o json | kubectl replace -f -

5. Enable Audit Logging

yaml — audit-policy.yaml

apiVersion: audit.k8s.io/v1
kind: Policy
rules:
- level: Metadata
  resources: [{ group: "", resources: ["secrets"] }]
- level: RequestResponse
  resources: [{ group: "", resources: ["pods/exec", "pods/portforward"] }]
- level: None
  users: ["system:kube-proxy"]
  verbs: ["watch"]
- level: Metadata

6–12. Remaining Controls

#	Control	Key Action
6	Restrict Container Images	Use OPA Gatekeeper / Kyverno to allowlist trusted registries
7	Harden the API Server	Disable anonymous auth, set `--authorization-mode=RBAC,Node`
8	Secure etcd	TLS client certs, restrict access to API server only
9	Set Resource Limits	CPU/memory requests and limits on every container
10	Disable Default SA Token	Set `automountServiceAccountToken: false` on deployments
11	Use Admission Controllers	Enable OPA/Gatekeeper for policy-as-code enforcement
12	Scan Images for CVEs	Integrate Trivy into CI pipeline, block on critical vulns

Wrapping Up

Security hardening is an ongoing practice. Start with RBAC, network policies, and pod security standards — they have the highest impact. Automate scanning in CI and review audit logs weekly.

→ Want the checklist as a PDF? The Complete IT Infra Bundle includes a Kubernetes Security Hardening checklist with all 12 controls, YAML manifests, and a runbook template.

Real-world guides forengineers who runproduction systems

Hosting, SSL & Domain Registration

🔒 Save 40–70% on SSL Certificates

Written by an engineer,for engineers

Real Feedback

Get the Free Linux Quick-Start Guide

eBooks & Courses

Host Your Projects & Secure Them

🔒 Save 40–70% on SSL Certs

How to Harden a Linux Server (Ubuntu 22.04): The Complete Security Checklist

Why Server Hardening Cannot Be Optional

Step 1 — Initial System Update & User Setup

Step 2 — SSH Hardening

Step 3 — Firewall Configuration with UFW

Step 4 — Brute-Force Protection with fail2ban

Step 5 — Automatic Security Updates

Step 6 — Kernel Parameter Hardening (sysctl)

Step 7 — Audit Logging with auditd

Step 8 — Disable Unnecessary Services

Bonus — Full Automated Hardening Script

Quick Reference Checklist

Wrapping Up

Cut Your AWS Bill by 40%: Spot Instances, Savings Plans, Rightsizing & More

The Right Mindset for AWS Cost Optimization

Step 1 — Get Visibility: Cost Explorer & Budget Alerts

Step 2 — EC2 Rightsizing with Compute Optimizer

Step 3 — Spot Instances: 60–90% Cheaper

Step 4 — Savings Plans & Reserved Instances

Deploy Your Cloud Projects with a Real Domain

🔒 Save 40–70% on SSL Certs

Step 5 — S3 Cost Optimization: Storage Classes & Lifecycle Policies

Step 6 — RDS Cost Cuts

Step 7 — Data Transfer: The Hidden Cost Nobody Talks About

Step 8 — Automate Stop-Start Schedules

AWS Cost Optimization Checklist

Wrapping Up

Docker Networking Deep Dive: Bridge, Host, Overlay, Macvlan & Custom Networks Explained

How Docker Networking Actually Works

Bridge Network — The Default

Custom Bridge Networks — The Right Way

Get a Domain + SSL for Your Dockerized Projects

🔒 Save 40–70% on SSL Certs

Host Network — Maximum Performance

Overlay Network — Multi-Host Communication

Macvlan & IPvlan — Direct L2 Network Access

None Network — Full Isolation

Container DNS Resolution Explained

Docker Compose Networking Patterns

Network Troubleshooting Reference

Production Network Design Patterns

Wrapping Up

Kubernetes Cluster Hardening: 12 Security Controls Every Engineer Must Implement

1. Enable and Enforce RBAC

2. Apply Pod Security Standards

3. Implement Network Policies

4. Encrypt Secrets at Rest

5. Enable Audit Logging

6–12. Remaining Controls

Wrapping Up

Product

Built by an Engineer,for Engineers

What You'll Find Here

Linux & Shell

Cloud (AWS/GCP)

Kubernetes

Docker & Networking

The Content Philosophy

Courses & eBooks

Get in Touch

Get in Touch

General Enquiries

Partnerships & Sponsorships

Found an Error?

Purchase Support

Send a Message

Frequently Asked Questions

About the Blog

Courses & eBooks

Newsletter

Real-world guides for
engineers who run
production systems

Written by an engineer,
for engineers

Built by an Engineer,
for Engineers