When an institution is seeing rating inconsistency, exception clustering, or examiner criticism of underwriting standards, the root cause almost always comes down to one of three things: weak decision governance, degraded data infrastructure, or an exception management framework that counts deviations without ever analyzing them.
These gaps tend to travel together. Weak governance lets officers apply policy selectively. Poor data means override rationale cannot be verified after the fact. An exception system that tallies up deviations but never breaks them down by officer, branch, or product creates a false sense of control. When all three gaps exist at once, portfolio drift moves faster than any annual review will catch.
Discretion is not the problem. Unguarded discretion is.
There is a persistent belief in lending culture that experienced officers exercising judgment produce better outcomes than structured rules. That belief is not wrong in principle. The Bank for International Settlements (BIS) identifies “sound, well-defined credit-granting criteria” as a foundational pillar of sound credit management, not the elimination of human judgment.⁵ The OCC similarly acknowledges that exception loans are often acceptable risks and should not be criticized just because they are exceptions.⁶
What the research shows, though, is that discretion without structure produces systematic drift, not random noise. In a study covering 242,011 retail loan applications across more than 1,000 branches, discretionary scoring manipulation was not confined to a handful of outliers. It was widespread. A one-standard-deviation increase in those manipulative scoring trials raised default rates by 0.3 to 0.4 percentage points relative to baseline, a 12–16% relative increase, with a profitability hit of 1.5 percentage points of ROE.¹ The individual officers were not all making reckless calls. The system just had no way to see or govern the aggregate pattern.
Gartner’s banking risk research has found that decision variability strongly correlates with portfolio instability.⁷ The mechanism is straightforward: inconsistent originations contaminate every downstream signal. When credit grades reflect the originator more than the borrower, the migration picture stops being reliable. When exception rates cluster in certain branches without triggering any alert, reserve models get built on inputs that no longer reflect actual risk.
The OCC puts it plainly: “when aggregated, even well-mitigated underwriting exceptions can significantly increase portfolio risk,” and financial institutions should track exception trends by department, loan officer, and over time, comparing the performance of exception loans against loans made within policy.⁶
Most institutions have exception tracking. Far fewer have exception analysis. Tracking tells you how many exceptions occurred. Analysis tells you which officers are generating them at rates well above the branch average, which product lines cluster during high-volume periods, and whether exceptions in specific geographies share common repayment risk factors.
The OCC’s 2023 consent order against United Fidelity Bank shows what happens when that analysis is missing. The order cited unsafe or unsound practices not just in underwriting and credit administration, but also in capital planning, stress testing, concentration risk management, ACL methodology, data management, and internal controls.⁸ That is not a coincidence. It is what happens when origination-level inconsistency propagates upward through every function that depends on rating reliability. The 2020 OCC action against Gateway Bank told a similar story: weak board and management supervision, inadequate internal audit, and failure to adhere to prudent loan administration.⁹ By the time those institutions were defending their practices in an exam, the origination-level problem had long since become an enterprise-level one.
The leading indicators are available well before losses show up: exception rates by officer, branch, and product; overrides clustering near score cutoffs; documentation exception trends; watch-list inflow velocity. FDIC research found that current underwriting-risk assessments improved prediction of asset-quality and CAMELS deterioration over the following calendar year.¹⁰ Acting on those signals requires infrastructure that surfaces them.
The Federal Reserve’s reliability threshold for internal rating systems is both specific and consequential. When examiner or internal loan-review downgrades reach 10% of loans reviewed or 5% of dollars reviewed, supervisors may treat the rating system as unreliable and direct the institution to reassess ACL and capital adequacy.² Under CECL, where reserve models are built on historical loss experience and current origination quality, grading drift in recent vintages flows directly into forecast error. At that point, a rating problem has become a capital planning problem built on a false baseline.
BIS is clear on this: internal ratings should be consistent, independently confirmed, and integrated into credit-risk analysis and capital adequacy assessment.⁵ When they are not, the portfolio on paper stops matching the portfolio that was actually built. Portfolio decay rarely starts with charge-offs. It starts with noisier grades, hidden concentrations, and delayed problem-loan recognition. By the time an institution is defending rating integrity in an exam, the governance failure happened long before.
High-performing financial institutions are not replacing credit officers with automation. They are building a governance layer that enforces policy consistently at every origination point, surfaces exception patterns in near real time, and routes non-standard cases to experienced officers with better information, not fewer guardrails.
McKinsey’s documented case study of a leading European financial institution is instructive. After redesigning its commercial lending workflow with automation and analytics, time-to-yes fell from 24–48 hours to four minutes for standard applications and origination costs dropped 30–40%. Roughly 40% of applications moved end-to-end without manual intervention. More complex cases were routed to credit officers with richer analytical support, improving portfolio-level oversight rather than cutting experienced judgment out of the loop.³
That architecture does something additional manual review layers simply cannot: it catches the exception at the moment it occurs, records the justification in a standardized format, and aggregates patterns by officer and branch in real time. It is worth noting that the OCC warns a lack of exceptions may signal a loan policy that is too general to set clear underwriting limits.⁶ The goal is not zero exceptions. It is controlled, visible, and analyzable exceptions. Structured decisioning delivers that.
Make Credit Decisions Governable
Portfolio performance does not drift by accident — it drifts when decisioning lacks structure, visibility, and accountability. The institutions that lead are not eliminating judgment; they are making it consistent, measurable, and aligned to policy at every origination point. That is what builds examiner confidence, strengthens reserve reliability, and creates portfolios that perform the way they are expected to. Baker Hill helps financial institutions bring discipline to credit decisioning — so outcomes are driven by design, not left to chance.
Because in credit, consistency isn’t control — it’s performance.
Baker Hill is the leading provider of lending technology for banks and credit unions across the United States. Each month, financial institutions use Baker Hill’s platform to process over $7 billion in loan originations. With trusted fintech innovation, AI-enabled automation, and deep banking expertise, we help institutions Lend Better, Lend Faster, and Lend More™.
We’re more than a platform. We’re your partner in building stronger communities through smarter banking.
Ready to build a faster, smarter SMB lending process?