Financial Inclusion

    Alternative Credit Scoring for India's Invisible Economy: How OpenCredit Works

    450 million Indians are credit-invisible despite consistent economic activity. OpenCredit uses mobile recharge, utility payments and business flows to deliver fair, open-source credit scoring — achieving 40% higher approvals.

    OpenCredit TeamJan 5, 20268 min read

    Alternative Credit Scoring for India's Invisible Economy: How OpenCredit Works

    An estimated 450 million Indians are effectively invisible to the formal credit system. They earn income, pay bills, run businesses, and meet financial obligations consistently — but because none of this activity appears in a bureau like CIBIL or Experian India, they are classified as NTC (new-to-credit) or thin-file, and their loan applications are declined before they are meaningfully assessed.

    This is not a niche problem. It is the central obstacle to financial inclusion for the world's most populous country — and it has a technical solution.

    OpenCredit is Setient's open-source credit scoring system designed specifically for thin-file and NTC applicants. It uses alternative data sources — mobile recharge patterns, utility payment histories, business transaction flows — to build credit scores that are fair, explainable, and as predictive as traditional bureau scores for the populations they cover.

    The Scale of India's Credit Invisibility Problem

    India's formal credit infrastructure covers approximately 300 million people. For the remaining 450 million credit-invisible adults, the options are limited: informal moneylenders charging 30–60% annual interest, geographically constrained microfinance institutions, or no credit access at all.

    The irony is that many of these individuals are creditworthy by any reasonable measure. A vegetable trader who has maintained her mobile prepaid plan for eight years, paid her electricity bill within three days of issue for six years, and maintained consistent positive cash flows in her UPI-linked business account is not a credit risk. She is an underserved customer.

    The problem is not her creditworthiness. It is the measurement system.

    Traditional credit scoring was built for salaried employees with bank accounts, credit cards, and prior loan history. It is a measurement instrument designed for a population it covers well — and structurally blind to everyone else.

    What Is Alternative Credit Scoring?

    Alternative credit scoring uses data sources beyond traditional credit bureau history to assess creditworthiness. Where traditional scoring models rely on prior loan repayment history, EMI track records, and credit card usage patterns, alternative scoring models draw from:

    Mobile and Telecom Data

    Prepaid recharge behaviour is a surprisingly powerful credit signal. Consistent, regular top-ups — particularly when maintained during financially stressed periods — indicate financial discipline and planning. Irregular or lapsing patterns suggest cash flow volatility.

    Key signals include:

    • Recharge frequency and regularity over 12–24 month windows
    • Recharge value consistency and trend direction
    • Gap analysis: missed recharges and recovery patterns
    • ARPU (average revenue per user) trends relative to regional benchmarks

    Utility Payment History

    Electricity, water, and gas bills represent fixed obligations that creditworthy individuals prioritise. Payment timing — how many days before or after the due date — is a direct measure of obligation management discipline.

    In India, where utilities are increasingly billed digitally, this data is accessible at scale and validated against actual payment outcomes. For MSME applicants, commercial utility accounts provide an additional dimension of business viability assessment.

    Business Transaction Flows

    For self-employed individuals and MSME operators, UPI and digital banking transaction flows provide a window into business health that no bureau score captures. Revenue consistency, seasonal patterns, supplier payment behaviour, and working capital cycles all contribute to a richer picture of repayment capacity than a thin-file bureau report.

    OpenCredit's MSME module integrates with bank statement APIs via the Account Aggregator framework to extract structured business signals from transaction data — with explicit user consent at each step.

    Behavioural and Psychometric Signals

    For populations where digital footprint is limited, behavioural assessment instruments — adapted for Indian literacy contexts and validated against actual repayment outcomes — contribute a small but meaningful signal to the ensemble model.

    The Technical Architecture of OpenCredit

    Feature Engineering Pipeline

    OpenCredit's feature engineering transforms raw alternative data into structured credit signals. The pipeline handles:

    • Time-series regularisation: Converting irregular transaction timestamps into consistent periodic features
    • Recency weighting: More recent behaviour receives higher weight, with configurable decay functions per signal type
    • Signal normalisation: Adjusting for geographic, demographic, and seasonal variation in base rates
    • Missing data handling: Principled imputation strategies for applicants with partial data availability across signal categories

    Ensemble Model Architecture

    The scoring model uses a gradient-boosting ensemble that combines:

    1. Bureau component: Traditional CIBIL/Experian signals where available, with full weight for bureau-positive applicants and zero weight for NTC applicants
    2. Alternative data component: The primary scoring layer for thin-file and NTC applicants
    3. Behavioural component: Psychometric and digital behaviour signals for low-footprint applicants

    The ensemble produces a score on a 300–900 scale consistent with CIBIL conventions, enabling direct comparison and integration with existing lending workflows without systems changes for partner lenders.

    Explainability by Design

    Every OpenCredit score can be traced to specific contributing factors and their weights. This is a core design requirement, not an optional feature. When a lender declines a loan using OpenCredit scores, the applicant can receive a clear explanation of which factors drove the decision and what could be improved.

    This satisfies both RBI's 2023 guidelines on explainable AI in digital lending and the practical trust requirements of extending credit to populations that have historically been excluded by opaque systems.

    Bias Detection and Fairness Monitoring

    OpenCredit includes continuous monitoring for disparate impact across protected characteristics:

    • Geographic bias: Urban/rural approval rate disparities relative to risk-adjusted baselines
    • Gender bias: Score distributions and approval rates across gender categories
    • Caste proxy bias: Correlated features that may act as proxies for protected characteristics
    • Religion-correlated signal bias: Features that may carry demographic correlation beyond creditworthiness signal

    Where disparate impact is detected above configurable thresholds, the monitoring framework flags the affected feature and triggers retraining review. Results are published in the public audit trail with each model release.

    Results: What Alternative Credit Scoring Achieves in Practice

    Working with partner NBFCs and fintech lenders across three deployment cohorts in Maharashtra, Karnataka, and Rajasthan:

    MetricTraditional ScoringOpenCredit Alternative
    NTC applicant approval rate12–18%47–55%
    90-day default rate (approved loans)2.1%2.4%
    Average loan size₹85,000₹62,000
    Time to decision3–5 days4–8 hours
    Explainability coverage95%100%

    The 40% higher approval rate comes at a modest 0.3 percentage point increase in default rate — a trade-off that most lenders find acceptable given the expanded addressable market, reduced customer acquisition cost, and the alignment with RBI's financial inclusion directives.

    Why Open Source?

    Credit scoring has historically operated as a closed system. Algorithms that determine who gets financial access — and at what cost — are proprietary secrets, unauditable by the communities they affect, and slow to adapt to new evidence.

    OpenCredit takes the opposite approach. The algorithm is Apache 2.0 licensed. The methodology is documented. Training data distribution, feature importance rankings, and bias audit results are published with each model release.

    This transparency serves three purposes:

    Trust: Communities that have been historically excluded by credit systems cannot be asked to simply trust a new algorithm they cannot see. Auditability is a prerequisite for trust, particularly in populations with justified historical scepticism of institutional finance.

    Improvement: Researchers, NBFCs, fintechs, and regulators can identify errors, suggest improvements, and contribute to the model. Proprietary systems improve in private; open systems improve in public, and faster.

    Accountability: When the model makes a mistake — and all models make mistakes — the error is visible, traceable, and correctable. There is no black box to hide behind when a lender faces questions about its decisioning methodology.

    How to Get Involved

    OpenCredit is a community project. We welcome involvement across the ecosystem:

    • NBFCs and banks: Deploy OpenCredit as an alternative scoring layer for NTC applicants — contact us for integration documentation and lender onboarding
    • Fintechs: Integrate via the OpenCredit API for thin-file credit assessment in embedded lending flows
    • Data scientists: Contribute to feature engineering, model architecture, bias monitoring, and regional data partnerships
    • Policy researchers: Use the open methodology for academic research on financial inclusion, credit access, and algorithmic fairness
    • Regulators: The open audit trail is available for regulatory review at any time — no access request required

    Frequently Asked Questions

    Is alternative credit scoring accepted by Indian regulators? Yes. The RBI's 2023 framework for digital lending explicitly permits alternative data in credit assessment, subject to explainability requirements and data sourcing compliance with applicable privacy law. OpenCredit is designed to meet these requirements. Lenders using OpenCredit should conduct their own regulatory assessment for their specific licence category and product structure.

    How does OpenCredit handle data privacy? OpenCredit processes aggregated and anonymised features — not raw transaction data. The scoring model is trained on de-identified datasets. Individual applicant data is not stored beyond the scoring transaction. For the MSME module using bank statement APIs, all data access follows Account Aggregator framework consent requirements, with explicit applicant authorisation for each data category.

    What is the minimum data required to generate an OpenCredit score? A minimum of six months of mobile recharge history or utility payment data can generate a meaningful alternative score. Score predictive accuracy improves significantly with 12 or more months of data and with access to additional signal categories. The model is designed to degrade gracefully when specific data sources are unavailable.

    How does OpenCredit compare to CIBIL scoring? OpenCredit is not a replacement for bureau scores — it is complementary. For applicants with CIBIL scores above 650, traditional scoring is predictive and OpenCredit adds marginal value. For NTC and thin-file applicants where bureau scores are absent or uninformative, OpenCredit is the primary assessment tool. Many lenders use both in tandem.

    Is OpenCredit available for lending outside India? The current model is trained and validated on Indian data and calibrated for Indian regulatory requirements. The open-source architecture is adaptable to other markets — the methodology for alternative credit scoring transfers, but feature engineering and bias monitoring must be rebuilt on local data. Contact us if you are evaluating OpenCredit for other emerging market contexts.

    Learn more about OpenCredit and explore the open-source codebase at /products/opencredit. Financial institutions interested in deployment should contact our team.

    Want to learn more?

    Get in touch to discuss how we can help your organisation.