You took over an existing SQL Server
The server works, but the team is still unsure which jobs matter, how restores are tested, and which old settings are safe to leave alone.
Services / SQL Server health audit
I review SQL Server environments before an upgrade, handover, audit, or production issue exposes problems nobody has checked.
The audit checks backups, restores, SQL Agent jobs, monitoring, configuration, tempdb, security, HA or DR, and recent performance symptoms.
Input
Version, topology, instance list, recent issues, and whatever details already exist.
Review
Backups, restores, SQL Agent, monitoring, configuration, tempdb, security, HA, and recent SQL Server symptoms.
Output
Prioritized findings, supporting details, what to fix first, and a cleaner split between internal work and follow-up help.
Fit
The audit is for environments that are quiet enough to review, but unclear enough that the next incident, upgrade, audit, or handover would be harder than it should be.
If one narrow symptom is already the whole problem, use a narrower service. If several parts of the environment need review, start here.
The server works, but the team is still unsure which jobs matter, how restores are tested, and which old settings are safe to leave alone.
Use the audit before an upgrade, migration, handover, client review, or external audit, when the team needs a clearer view of the SQL Server setup.
Backups run, SQL Agent jobs finish, and monitoring exists, but nobody can explain the restore path or the first response during an incident.
Maintenance quality, monitoring gaps, restore testing, access, and old configuration choices are starting to affect the same SQL Server.
Review
The review follows the environment, not a generic scorecard. The goal is to find the risks that would matter during pressure or change.
Map
Instance count, critical databases, dependencies, owners, support contacts, and the parts of the setup nobody has checked recently.
Recovery
Backup coverage, retention, restore tests, recovery timing, integrity checks, and whether the restore plan is tested or mostly assumed.
Maintenance
Job purpose, schedules, failure handling, integrity checks, index and statistics routines, cleanup, and maintenance windows that no longer fit.
Monitoring
Whether monitoring explains incidents, whether alerts reach the right people, and whether the team has useful history instead of noise.
Platform
Memory, MAXDOP, tempdb layout, file growth, storage pressure, log behavior, and default settings that may no longer fit the workload.
Security
Sysadmin members, service accounts, SQL Agent proxies, linked servers, risky enabled features, patch level, and production access habits.
Resilience
Always On, clustering, failover behavior, replica gaps, jobs and logins after failover, runbooks, and claims that still need testing.
Pressure
Waits, blocking, deadlocks, tempdb pressure, and workload timing as health checks, without turning the audit into a narrow tuning job.
Technical checks
A health audit should show what has actually been checked. The exact list depends on the environment, but these are the usual SQL Server areas I expect to review.
Output
You get a short findings summary, the details behind the important points, and an order of work the company can actually use.
The output explains what matters operationally instead of handing over a generic best-practice score.
Important findings point back to the job history, configuration, monitoring data, restore testing, or SQL Server gap behind them.
Immediate, scheduled, follow-on, and watch items are separated so the team does not turn the audit into another parked list.
The review makes it clearer what the internal team can handle and where deeper outside help is worth using.
Process
The first message only needs enough context to decide whether this is the right shape of work.
Scope depends on instance count, access, urgency, and how much useful detail already exists.
Step
Send the SQL Server version, topology, rough urgency, known concerns, and any details already available. It does not need to be pretty.
Step
I check the areas that matter for the environment: recovery, maintenance, monitoring, responsibility, configuration, security, HA, and pressure symptoms.
Step
The discussion separates real risk from cosmetic mess. Some old ugly things can wait. Some quiet assumptions should not.
Step
You get the findings, the details behind them, and the order of work: what to fix now, schedule later, investigate next, or simply watch.
Proof and reading
Related
A practical checklist for ownership, backups, maintenance, monitoring, configuration, and what to fix first.
Open checklist
Related
What to check when an existing SQL Server environment needs a broader review.
Open guide
Related
How to review SQL Agent jobs, CHECKDB, backups, cleanup, index work, and schedule overlap.
Open guide
Fit check
Start with the performance review or send the urgent production problem directly.
See performance review
Use recovery readiness when the main question is backup, restore, failover, runbooks, or recovery timing.
See recovery readiness
Use upgrade support when the core work is version change, compatibility, test run, rollback, and validation.
See upgrade support
Use the consulting parent page when the shape is still unclear or crosses several service types.
See SQL Server consulting
Before you send it
Send the rough situation, not a polished project brief. Version, topology, urgency, and the main concern are enough to start.
If there are existing job outputs, restore notes, monitoring screenshots, error logs, or incident notes, include them. Evidence beats guessing.
The first useful decision is whether the audit is the right shape of work or whether the problem is already narrower.
I will look at the context and come back with the sane next step. Sometimes that is a full audit. Sometimes it is a narrower review.
FAQ
The audit reviews the environment map, responsibility, backups, restore tests, maintenance jobs, monitoring, configuration, tempdb, security and admin access, HA or DR assumptions where relevant, and the SQL Server symptoms that show where the setup needs attention.
Yes, but I use audit language because the useful output is not just a checklist. The point is to reduce uncertainty, explain the findings, and give the company a clear list of what to fix first.
Not always. Some work can start from exports, scripts, screenshots, monitoring history, job output, and restore notes. Direct read-only access may make the review faster when the environment is messy.
Send the SQL Server version, topology, rough instance count, main concern, recent incidents, and any existing details around jobs, backups, monitoring, errors, waits, or restore tests.
Yes. Remote delivery is the default when the company can share enough context, details, and access for a proper review.
Scope depends on instance count, access, urgency, and how much useful detail already exists. I do not need a polished brief for the first message.
You get findings and a practical list of what to fix first. Some items can usually be handled internally. Others may need focused follow-up work around performance, recovery, upgrades, monitoring, or cleanup.