a practitionerâs checklist / Habr

November 2, 2025 - By admin

People have always valued privacy. Developments of the past decades â the internet, social networks, targeted advertising â turned data into an asset. The AI wave multiplies what can be inferred from crumbs. As a predominantly mobile engineer, Iâve worked on apps that manage email inboxes, upload photos and videos to the cloud, and provide encrypted messaging. Phones and apps are integral to peopleâs lives. Some users keep everything on their phones; others are more restrictive. It shouldnât rely only on user awareness: developers should provide the first line of defence and the tools that protect a userâs right to privacy. From an engineering perspective,Â privacy is an engineering constraint. Use a simple lifecycle âÂ collect â store â process â send â deleteÂ â and treat each stage as a surface with budgets (fields, TTLs, queue sizes) and controls (consent, encryption, redaction, access). Even if you already deal with most of these pieces daily, I want to share my mental model â how I frame decisions with checklists and a few concrete examples from practice.

1. Privacy, precisely

What is privacy exactly? There are dozens of definitions, and I wonât pick a single one â most of us have a natural feel for it. One note that will help as we go:Â privacy is not security. We wonât dive into specific encryption algorithms or attacks; assume the mathematical and technical foundations work. The question isÂ whenÂ andÂ whyÂ to use them. My oversimplified view of privacy on mobile limitsÂ whyÂ we process,Â how muchÂ we move, andÂ how longÂ we keep the data. The lifecycle below isnât a strict timeline; a feature can hit âsendâ more than once. The point is to treat each stage as aÂ privacy surface. Weâll use that model throughout â threats first, then what counts as personal data, lawful grounds, minimisation/retention, and telemetry boundaries.

1.1 Start with a threat model

Threats decide where we spend effort. Imagine weâre building a secure chat client. Attackers arenât only people trying to break the cryptographic protocol. Most incidents are boring: a debug log prints a phone number; the appâswitcher snapshot shows a private chat; a crash report scoops up payload fragments; a WebView cache outlives a userâs logout. Add two more realities: insiders with too much access to analytics dashboards, and lawful requests where scope and auditability matter. We also consider external factors such as device theft, rooted/jailbroken phones, and network attacks, though some are hard to prevent entirely. Day to day, the biggest wins come from removingÂ accidentalÂ leak paths and proving we can answerÂ narrowlyÂ when required. In the table below, I highlight which parts of the lifecycle are more prone to a given threat (read it as âwatch closely during reviewsâ).

Threats / surfaces	Collect	Store	Process	Send	Delete
Accidental leak	ââ	âââ	ââ	ââ	âââ
Insider overreach	â	ââ	âââ	âââ	â
Attacker (device/net)	â	âââ	ââ	âââ	â
Lawful request	â	â	â	âââ	âââ

1.2 What counts as personal data (so we speak the same language)

This is a regulated space, and Iâm not giving legal advice. Fortunately, even though regulators differ (UK/EU GDPR, Indiaâs DPDP, US state laws), a safe engineering stance is consistent:

Personal data: anything that relates to an identifiable person, directly or indirectly. On mobile, assume most telemetry qualifies.
Some fields areÂ sensitiveÂ (e.g. health, precise location, biometrics used for unique identification). These need extra care.
Pseudonymised:Â data isÂ still personalÂ if we can reâlink. In a chat client, a mapping table or login token keeps it in scope.
Anonymous:Â rare in practice. Donât label analytics âanonymousâ unless you genuinely canât reâidentify them.

âIs this personal?â quick tree.

When I say âtreat as personalâ, include it in consent, minimisation, retention and deletion decisions.

1.3 Lawful grounds to process

Data flows from users through your systems and back again. Regardless of complexity, every data flow needs a reason the law accepts. In apps, three show up again and again:

ContractÂ â necessary to deliver the service the user asked for. In a chat client: sending messages, push delivery, device tokens for contactability. In a photo app: geoâlocation to tag photos.
ConsentÂ â a clear, affirmative choice the user can withdraw as easily as they gave. In a chat client: optional diagnostics, marketing, experimental models that arenât essential. In a photo app: allowing onâdevice training of vision models.
Legitimate interestsÂ â a documented balance test (purpose, necessity, impact). In a chat client: antiâabuse signals, coarse reliability telemetry.

You donât need to be a lawyer to use these well â though carefully listing all attributes can make you feel like one â but youÂ doÂ need to write them down per data flow. It keeps debates concrete.

1.4 Minimisation and retention in practice

Here we move closer to execution â specifically how to work with code. In your app minimisationÂ might mean:

Contact discovery without uploading full address books (hashes or privateâset intersection).
Event schemas that donât include message content or full phone numbers; store a reference instead.
Onâdevice ML for spam detection, sending only model updates or coarse aggregates if you need fleetâlevel learning.

RetentionÂ timeâboxes everything:

Messages cached on device until delivered or a short TTL; after that, wiped.
Encrypted offline queue with a maximum age and size; old items are dropped rather than shipped late.
Crash/diagnostic data expire locally; server retention mirrors the client.
Logout and account deletion wipe Keychain/Keystore entries, databases, caches, and WebView storage, not just app tables.

One of the most important aspects of retention is deletion.Â Itâs important yet easy to overlook, which is why deletion needs an owner. Ship a smallÂ deletion orchestratorÂ that wipes local state and coordinates server calls. It clears Keychain/Keystore, the encrypted DB, caches and WebView storage, and drops undelivered telemetry past TTL. Features donât handâroll erasure; they call the orchestrator. After the server ack, the wipe finishes within seconds and the app broadcasts a âdata erasedâ signal so modules purge inâmemory state too.

1.5 Telemetry boundaries (logs are data)

Telemetry is where strong designs die by a thousand cuts. From small startâups to giants, teams are dataâdriven, and decisions ride on logs and analytical events. If your app brokersÂ encrypted conversations between two people, a single âhelpfulâ log line can link the parties and make traffic analysis trivial. Treat dynamic values asÂ private by default. Redact at the callâsite. Keep sampling and killâswitchesÂ on deviceÂ so you can turn noisy families off without a new build. Give telemetry the same deletion guarantees as product data: queues age out; debug captures donât survive a day; testâonly trust stores never ship in Release.

2. Why it matters

I get the urge to be first to market: ship fast, sort the details later. With that mindset, privacy can feel like a niceâtoâhave. But privacy isnât just ethics; itâs how we keep shippingÂ without surprises. Weâll stay with the chat client. We use a robustÂ cryptographic protocolÂ for endâtoâend messages. Thatâs necessary, but not sufficient. If the client leaks around the edges, the protocol wonât save us.

Two common failure modes:

Listing vs reality.Â The store page says we donât collect contact details, yet a diagnostics SDK includes emails in crash payloads. Thatâs review pingâpong at best; a block at worst.
Consent drift.Â Analytics start sending before the user decides. Our standards say âask firstâ; the build says otherwise. Strong cryptography, weak discipline.

Platform rules set the floor

We donât build in a vacuum. We ship onÂ iOSÂ andÂ Android, with their policies, forms and runtime checks. Treat these asÂ engineering constraints.

On iOS, the âApp Privacyâ section must match what the binary does â firstâparty codeÂ andÂ SDKs. If they diverge, submission stalls.
On Android, âData safetyâ and the privacy policy must accurately describe collection, sharing and protections. If the app behaves differently to the form, youâll feel it at submission time.

Design with that in mind: if the chat client logs aÂ push_token, itâs personal data and must be declared. If contact discovery uploads hashed numbers, itâs still collection. If diagnostics are optional, consent lives in the UIÂ beforeÂ the first send.

Thirdâparty SDK intake.Â SDKs go through the same constraints as our code. Be at least twice as cautious with third parties: vulnerabilities in your code are usually unintentional; with an SDK they can be both accidental and deliberate. Each SDK gets a oneâpage intake card: purpose + lawful ground, exact fields collected, retention on their side, consent gate and killâswitch, endpoints/pinning, and whether store disclosures change. Wrap the SDK behind your interface and route its sends through the same interceptors and encrypted queue. Until consent, the wrapper is a noâop.

It pays back in delivery speed

We shouldnât block creativity or delivery speed with red tape. The trick is to enrich workflows so checks are cheap. A small Privacy Block in the PR sets the purpose, grounds, budgets and deletion path. SDKs go through a repeatable intake. Telemetry defaults are safe by design. The store forms are easy because they reflect the same source of truth. Fewer âwhat is this event?â threads. Fewer lastâminute fixes.

Privacy SLOs help.Â Track a few budgets like performance: queue TTL â¤ 7 days; max queue size â¤ 500; diagnostics default â¤ 1%; killâswitch propagation < 10 minutes. Secrets only in Keychain/Keystore. No Release build contains test trust roots. Tune numbers to your product and enforce them in CI/canary.

3. Mobile specifics

Clientâside privacy patterns look similar across userâfacing apps, but modern phones donât trail desktops â they often exceed them in what they can collect. A phone lives in your pocket, not on a desk. It carries sensors (location, camera, mic, Bluetooth), share sheets and pasteboard, and background tasks that run when the screen is off. That creates more ways to collect, more places to stash data, and more chances to leak.

3.1 Onâdevice data: pick the right home

Most data starts life on the device. Treat it like hazardous material: move less of it, and keep it where escape is least likely.

If we go back to our chat client, that translates into:

Secrets and credentialsÂ â identity keys, session keys, refresh tokens â live inÂ Keychain/KeystoreÂ with the tightest accessibility you can tolerate. Where hardwareâbound keys and biometry are available, require them for userâinitiated actions (e.g. revealing a recovery phrase). Keep secretsÂ outÂ of shared appâgroup containers; share handles, not keys.
Business recordsÂ â message metadata, device records, delivery receipts â belong in anÂ encrypted database(Core Data/Room + encryption, or an encrypted file store). You get indexing and syncâfriendly queries without turning the keychain into a database.
Derived/temporaryÂ â thumbnails, model features, prefetch caches â go toÂ ephemeral caches with TTLs. If you can regenerate it, donât back it up.

Backups and snapshots.Â Exclude caches and generated artefacts from backup. On Android, use backup rules; on iOS, set the doânotâbackup flag for throwaway files. Obscure sensitive screens in the app switcher (Android:Â FLAG_SECURE; iOS: blur/hide the background snapshot). Avoid pasteboard leaks on chat screens.

Work profiles and Direct Boot.Â On Android, secrets should live inÂ credentialâencryptedÂ storage, not deviceâencrypted space available before first unlock. In workâprofile deployments, respect crossâprofile boundaries; donât leak personal identifiers into the work side.

3.2 Logging & telemetry: donât let observability undo privacy

Redact at source.Â Treat dynamic values as private by default. If you must print identifiers, mask or hash at the callâsite. Structured events with typed fields beat adâhoc strings.
Never join both ends.Â Donât recordÂ sender_idÂ andÂ recipient_idÂ together in one event. When pairing is genuinely required, use synthetic correlation that rotates often.
Budgets live on device.Â Sampling rates, queue TTLs and a global killâswitch should be switchable without a new build. Noisy families die fast.
Crash reports & traces obey the same rules.Â Scrub payloads at SDK boundaries; redact request/response bodies before persistence; keep local captures shortâlived.

3.3 Network boundaries

When people think ânetworkâ, they jump to security: HTTPS, certificates, proxies. In this prism, I see the network as the moment dataÂ leaves the device.

Interceptors first.Â Everything outbound flows throughÂ interceptorsÂ that enforce redaction, apply idempotency keys, and write to anÂ encrypted offline queue. That queue has hard limits: age, size, and retry budget with backâoff and jitter. If the budget is exhausted, drop rather than ship stale data.
Pin to an identity you can rotate.Â Use TLS properly andÂ pinÂ to a stable identity (SPKI or an intermediate), not a single leaf cert. ShipÂ overlapping pinsetsÂ so you can add the new one, then retire the old one later. Never rotate in a single step.
mTLS sparingly.Â Mutual TLS helps when you must strongly bind a device to a private API, but it complicates rotation and recovery. Use it when the benefit is clear; otherwise rely on leastâprivilege tokens with strong pinning.
Debug without footâguns.Â Proxies and custom trust stores stayÂ Debugâonly. Ship toggles that enable extra logs or local captureÂ onlyÂ on nonâproduction builds. Never add test roots to Release.

3.4 ML on device: keep learning local

ML in apps isnât new, and with the AI boom weâll see even more of it. Iâm especially interested in onâdevice models â they have real potential to be yourÂ privateÂ helper, compared to large server models that also feed on inputs. Examples Iâve shipped or reviewed: spam scoring, media quality adaptation, keyboard suggestions.

Inference first.Â Run modelsÂ on device. Inputs never leave; outputs are small, derived signals (e.g. âspam probability 0.91â, not message text). Cache features with short TTLs.
Updates are code.Â Treat model files like binaries:Â sign,Â verify,Â stage,Â activate, and keep aÂ rollback. Canary new versions to a small percentage before full rollout.
If you must learn across devices.Â PreferÂ federatedÂ approaches where updates (not raw data) are aggregated. Add guardârails: minimum cohort sizes, noise where appropriate, and strictÂ privacy budgetsÂ for any evaluation telemetry. Never ship gradients or metrics that can be tied back to message content or a single user.
Model drift without hoovering data.Â Track coarse, nonâidentifying signals: version adoption, onâdevice confidence histograms, rate of âdisagreeâ events (user marks ânot spamâ). Enough to detect regressions without collecting inputs.

4. Checklists & templates (copyâpaste ready)

Short and opinionated.

4.1 Design doc snippet

## Privacy summary (for this design)

**Threats in scope:** [accidental logs, snapshots, cache residue, lawful request scope]

**Grounds:** [Contract for delivery; Legitimate interests for antiâabuse (LIAâxxx); Consent for diagnostics]

**Minimisation:** [fields reduced; no message content in telemetry; phone numbers not stored, only salted hashes]

**Retention:** [queue TTL 7d; diagnostics 30d; local caches TTL 24h; server mirrors client]

**Send gating:** [all sends through interceptors â redaction â encrypted queue â retry/backoff]

**Deletion owner:** [Deletion orchestrator clears keychain/keystore, DB, caches, WebView; drops stale queue items]

4.2 SDK intake card

# SDK Intake

**Name & version:**
**Owner:**
**Purpose:**
**Grounds:** [Contract / Consent / Legitimate interests (LIA link)]

## Data contract
| Field | PII class | Purpose | Retention (SDK side) | Notes |
|------|-----------|---------|----------------------|-------|

**Runtime controls:** consent gate? perâcountry toggles? killâswitch path?

**Routing:** domains; proxy? pinning policy? uses our interceptors? [Yes/No]

**Access:** secrets? [None] tokens? [shortâlived only]

**Store impact:** privacy details / Data safety update required? [Yes/No]

**Test plan:** block until consent; verify no sends with Debug off; verify killâswitch

4.3 Telemetry event spec (schema template)

event: delivery_result
version: 3
purpose: reliability_metrics
grounds: legitimate_interests  # LIA-123
sampling: 0.01
retention_days: 30
fields:
  - name: sender_id
    type: string
    pii: pseudonymous
    join_with: forbidden  # never with recipient_id
  - name: result
    type: enum { success, timeout, error }
    pii: none
  - name: latency_ms
    type: int
    pii: none
kill_switch: events.delivery.*
notes: "No content, no phone numbers, no emails"

Lint rules (house style):Â dynamic values default toÂ private; redaction at callâsite. No event may include both sides of a conversation. Every event needsÂ sampling,Â retention_days, and aÂ kill_switchÂ pattern.

4.4 âWhich store when?â cheat sheet

Data shape	Where it lives	Why
Secrets/credentials (keys, tokens)	Keychain/KeystoreÂ (hardwareâbound if available)	Nonâexportable, access control, right unlock semantics
Records (metadata, receipts)	Encrypted DBÂ (Core Data/Room or encrypted file store)	Queryable, syncâfriendly, rotation without key loss
Derived/temporary (thumbnails, features)	Ephemeral cacheÂ with TTL, not backed up	Safe to drop, cheap to rebuild

Backups/snapshots:Â exclude caches from backup; secure the appâswitcher snapshot; avoid pasteboard leaks.

4.5 Network belt (interceptors â queue â transport)

Interceptors applyÂ redaction,Â idempotency keys, and write to anÂ encrypted queue.
Queue hasÂ TTLÂ andÂ max items; stale drops instead of âeventually sendingâ.
Transport enforces TLS,Â pinning to SPKI/intermediate, andÂ retry with backâoff + jitter.
Debug roots/proxies never ship in Release.

Pin rotation playbook (one paragraph):Â shipÂ A+BÂ â confirm adoption â shipÂ B+CÂ â removeÂ AÂ â later removeÂ B.

4.6 Deletion orchestrator (client wipe)

**Triggers:** logout, account deletion, DSAR erase

**Steps:**
1) Cancel background tasks; freeze queues.
2) Wipe keychain/keystore entries (access group aware).
3) Drop encrypted DB / tables that hold personal data.
4) Clear caches (files, image stores); secure snapshot artefacts.
5) Clear WebView storage and cookies.
6) Drop queued telemetry older than TTL.
7) Broadcast "data_erased" to modules; clear inâmemory caches.
8) Verify empty via quick audit hooks; resume with a clean state.

4.7 Release checklist (privacy)

[ ] App Store privacy details reflect reality (incl. SDKs).
[ ] Play âData safetyâ + privacy policy updated from the data dictionary.
[ ] No debug trust roots or proxy bypass inÂ Release.
[ ] Pin rotation overlap active; rollback plan tested.
[ ] Killâswitch toggle observed in prod canary withinÂ <10 min.
[ ] Deletion orchestrator e2e tested (local wipe + server).
[ ] Snapshot test passes; logs show only redacted data.
[ ] SDK intake cards reviewed; toggles/consent verified.

4.8 DSAR (access/erasure) quick script (client responsibilities)

**Access:** fetch userâvisible artefacts (profile, devices, settings). Do not resurrect deleted caches while viewing exports.

**Erasure:** run deletion orchestrator; confirm server ack; show status to the user; ensure queues donât send postâerasure.

**Edge cases:** offline device at deletion time; work profile; app reâinstall. Document behaviour for each.

4.9 âTen smellsâ that trigger a privacy review

New SDK added without an intake card.
Leafâcert pinning and a calendar reminder to rotate.
Event schema missingÂ retention_days.
Event joins both ends of a private interaction.
Secrets appear in an appâgroup container or DB.
âTemporaryâ files that are in backups.
Debug trust roots in a Release artefact.
Consent buried in Settings.
Crash reports contain request/response bodies.
Deletion means âdelete server rowâ only.

Collect less, move less, keep less â and delete on time. Guard rails in the client beat reminders in docs. Push every change through the same belt: consent, redaction, budgets, storage choice, network gate, deletion. If that sounds boring, good â boring privacy is exactly what we want.

a practitionerâs checklist / Habr

a practitionerâs checklist / Habr

1. Privacy, precisely

1.1 Start with a threat model

1.2 What counts as personal data (so we speak the same language)

1.3 Lawful grounds to process

1.4 Minimisation and retention in practice

1.5 Telemetry boundaries (logs are data)

2. Why it matters

Platform rules set the floor

It pays back in delivery speed

3. Mobile specifics

3.1 Onâdevice data: pick the right home

3.2 Logging & telemetry: donât let observability undo privacy

3.3 Network boundaries

3.4 ML on device: keep learning local

4. Checklists & templates (copyâpaste ready)

4.1 Design doc snippet

4.2 SDK intake card

4.3 Telemetry event spec (schema template)

4.4 âWhich store when?â cheat sheet

4.5 Network belt (interceptors â queue â transport)

4.6 Deletion orchestrator (client wipe)

4.7 Release checklist (privacy)

4.8 DSAR (access/erasure) quick script (client responsibilities)

4.9 âTen smellsâ that trigger a privacy review

Related Posts

Generator Umpan RSS, Buat umpan RSS dari URL

Juri tidak bisa tidur, menangis dan bingung saat pembatalan sidang dinyatakan dalam kasus 'MEV bros' – DL News

Denmark memberlakukan larangan media sosial bagi pengguna di bawah 15 tahun

Leave a Reply Cancel reply

a practitionerâs checklist / Habr

3.1 Onâdevice data: pick the right home

3.2 Logging & telemetry: donât let observability undo privacy

4. Checklists & templates (copyâpaste ready)

4.4 âWhich store when?â cheat sheet

4.5 Network belt (interceptors â queue â transport)

4.9 âTen smellsâ that trigger a privacy review