Use case framing
Define the task precisely: input, required output, quality threshold, latency requirement, and acceptable error rate. Ambiguous use cases cannot be evaluated fairly against any model.
Intelligence Layer Strategy
Before committing to proprietary model development, validate that the use case cannot be solved adequately with off-the-shelf capabilities — and define the threshold that would justify the investment to build something custom.
Method
Most AI use cases in financial services can be addressed with commodity models and prompt engineering. This process establishes whether a use case is truly in that category — or whether proprietary investment is warranted.
Define the task precisely: input, required output, quality threshold, latency requirement, and acceptable error rate. Ambiguous use cases cannot be evaluated fairly against any model.
Run the use case against available commodity models with baseline prompting. Measure output quality against the defined threshold using representative real-world inputs.
Attempt to close the quality gap through structured prompting, few-shot examples, and chain-of-thought techniques. Establish the practical ceiling achievable without model customization.
Quantify the remaining quality gap and assess whether it represents a material business risk — or an acceptable trade-off given the time and cost of custom builds.
Make a documented decision: deploy the commodity solution, continue with prompt optimization, or escalate to proprietary model investment with a defined justification.
Outputs
Benchmark results for commodity models against the defined quality threshold for the use case.
Documentation of optimization attempts, results, and the practical ceiling achieved.
Documented decision with supporting evidence for proceeding with commodity or escalating to custom.
If proceeding with commodity: configuration, monitoring plan, and escalation triggers.
Engagement Cadence
Output: a clear, evidence-based decision on whether commodity models are sufficient — and if not, exactly what the custom build would need to achieve to be worth the investment.