Document fraud detection is one of the growing applications of AI in mortgage operations.
In this issue, I’ll show you how you can apply AI fraud detection to your mortgage operations.
I designed a solution to demonstrate how to automate the detecting of fake mortgage documents and fraud using one of the emerging AI fraud detection tech providers.
This solution design consists of the 3 parts:
I hope it will give you enough insight into AI fraud detection to gauge if it is something that can benefit your mortgage operations and how to apply it if it is.
It is easy to get caught up in all the shiny innovations and waste resources on a tech that might not add value to the company.
So, before diving into a solution, I think it’s crucial to understand what problems in mortgage operations we’re trying to solve.
Below, you can find my analysis of the following:
Mortgage lenders receive thousands of loan applications.
They make lending decisions based on the documents representing applicants' employment, income, assets, and debts.
The job to be done is to ensure that documents provided by the borrower are authentic to prevent fraudsters from getting loans they don’t qualify for.
The document types that we need to identify the authenticity of:
Mortgage documents are highly susceptible to fraud. Fraudsters can inflate their bank balance and hide evidence of bad spending to access larger loan amounts.
When loans are given to those who don't qualify, they possess a significantly higher risk of defaults and buybacks.
So, if a lender or broker is negligent in preventing document fraud, it results in the following:
Like most processes worth automating, there’s already a manual solution in place to get the job done.
From my research, the old way of mortgage fraud detection works like this:
The manual document review process has several limitations:
These limitations lead to significant operational challenges, including:
The new way of fraud detection is to leverage AI to automate the process.
AI fraud detection products run 100s of checks on every document. And in seconds, they give you a risk score and reasoning for the score.
It’s hard to say what exactly checks the fraud detection tech runs as each tech provider might have different approaches.
But there are common layers of checks that I think almost every fraud detection product relies on:
AI document tampering detection uses computer vision to identify forged pixels in documents that might be invisible to the human eye.
ML fraud detection relies on extracting data from the documents and running it through models trained on historical fraud data to make predictions.
3d party validation relies on cross-validating details extracted from the documents with data from their databases and from 3d party providers.
AI fraud detection unlocks a few new capabilities that weren't possible with the old approach:
These new capabilities drive positive business outcomes like:
With WHY we’re doing what we’re doing out of the way, let’s dive into HOW to do it.
Below is an overview of the automated mortgage fraud detection solution using AI.
Each section below has a more detailed dive into each solution node.
The first problem to solve is that our fraud detection system has no access to documents.
Since we have no automation without documents, the first step of the solution is pulling files into our system.
You can usually pull all the documents you need via API from LOS or POS.
Sometimes, you might need to capture documents from an Email inbox, DropBox folder, or FTP server folder.
To process documents, our systems need:
The problem is that we obtain Files from upstream integration. And Files aren’t the same thing as Documents.
It’s common for the file to contain multiple documents within.
The solution is to run files through a document classification system that will provide a list of classified single-document files ready for processing.
You can find more info about document classification in this post I wrote.
Though fraud detection with AI is much more cost-effective than manual review, it still costs money.
Yeah, that sucks, I know.
Since most of us don’t have unlimited money, we need to be mindful and not throw whatever documents we’ve got into the fraud detection node.
So, using the data from document classification, we’ll route for processing only documents that are most susceptible to fraud.
That’s the part where all the magic happens.
In this step, we feed the routed documents into the AI fraud detection tech (Resistant.ai, in my case).
Note: You can find more about my product choice for this solution in the section below.
Interaction through the API is pretty straightforward:
Resistant.ai runs 500+ checks in under 30 seconds and provides a Risk score and Indicators as an output.
Risk score reflects where the document sits on the scale from Trusted to High Risk. Risk sore value might be one of these 4: Trusted, Normal, Warning, High Risk.
Indicators contain the reasoning of the system for the score it set. Some indicators may be positive, while others may be negative.
Then, the Risk score and Indicators are fed into the next node of the solution.
The output from Resistant.ai is a document risk score and the reasoning for this score.
Humans are still the ones who make the decision whether the document is authentic or fake.
The vast majority of the documents going to be authentic. So, it might make sense to automate decisions for clear-cut cases to save time.
In my solution, I automatically label a document as authentic if it has a Trusted or Normal score and as fraud, if it has a score equal to High Risk.
If the document has a score equal to Warning, I escalate the case and forward the document for further human investigation.
Once a document is flagged for investigation, someone must review it and decide whether it is authentic or fraudulent.
We display the indicators contributing to the document's score to reduce the time spent on the investigation.
Reasons to trust the documents and reasons why it might be a fraud.
Most of the tech providers have their own UI for this purpose.
I think that in most cases, the information provided by Indicators should be enough to make a decision without being an expert in document fraud detection.
After either an automated decision or a case investigation, we know whether the document is fake or not.
The last step in the solution is to let downstream systems know this information so they can act accordingly.
It can be adding a tag to a file within a LOS, creating a ticket, or sending an automatic notification to a relevant person.
I designed this solution around an AI fraud detection product. Below is my analysis of the tech providers available on the market and my preferred choice for this solution.
You always have the option to build your fraud detection tech using low-level infrastructure providers like AWS, GoogleCloude, or Azure. Or buy one from the product company.
In my opinion, in most cases, it doesn’t make sense for mortgage lenders to build custom fraud detection tech.
It is a highly sophisticated piece of software. It will cost more, take more time, and have lower accuracy than buying from a tech provider.
So, when selecting tech providers for this solution, I only focused on those offering complete AI fraud detection products.
Here’s the most relevant AI fraud detection tech for mortgage lenders that I came across when doing research for this solution:
I think any of the 4 products listed above will be able to meet your AI fraud detection needs.
When designing this solution, my eye fell on the Document Forensics product from Resistant.ai.
Though I haven’t been on a call with them, from reviewing their website and documentation, these are the things that contributed to my decision:
I hope this post helped you understand how to use AI to automate mortgage document fraud detection.
If you’d like to stay on top of the latest mortgage tech and how it can be applied to mortgage operations, consider joining our mortgage technology newsletter.