DG Developing Open Source AI Tool to Help with the Fight against Corruption
Development Gateway: An IREX Venture (DG) is pleased to announce that—with support from Accountability Lab (AL)—we’re developing an open-source AI tool that we hope to register as a Digital Public Good and can be used by those fighting corruption as they develop innovative digital tools. The need for the tool was identified in DG’s work with AL’s HackCorruption initiative.
Once the tool is developed, users will be able to extract data from various types of documents. Having an open source tool for this type of work is essential to anti-corruption efforts because a lack of machine-readable data that cannot be easily aggregated adds to the opacity that can hide corruption. This tool will increase accessibility of and transparency around hard-to-access information, which will advance accountability.
HackCorruption is a series of events and policy work led by AL and aimed at supporting innovative solutions to identify and combat corruption. HackCorruption is supported by the Bureau of International Narcotics and Law Enforcement in the U.S. Department of State and the USAID Countering Transnational Corruption Grand Challenge for Development, in partnership with the Center for International Private Enterprise and DG.
Why develop this tool?
The need for this tool was identified as part of DG’s work with HackCorruption regional hackathon teams. Several teams that participated in the South America hackathon in April 2023 discovered that they were limited in the work they could do by not having access to free tools to scrape information from documents, including government contracts. Having access to this sort of information allows anti-corruption tools to aggregate, monitor, and identify potential corruption.
Acting as a mentor to several of these teams, DGer Gabriel Inchauspe knew such AI tools exist, but none of them are open source and therefore, anti-corruption workers are limited in accessing them. Inchauspe and DGer Kelley Sams then collaborated with AL to obtain funding and define the scope of a project to create such a tool, which will support and ease the administrative burden faced by anti-corruption workers.
What will the tool do?
The AI tool—a beta version of which DG plans to launch by the end of 2024—will be based on publicly available source code and will be trained on ethically sourced data, with multiple rounds of testing done to ensure that the tool is statistically unlikely to produce non-existent or inaccurate information (i.e., hallucinate).
In the first phase of developing the tool, it will be able to capture information from documents in English that are in a standardized format (e.g., pdf, JSON, Excel, OpenContracting formatting, etc.). A planned second phase is to expand the tool to be able to extract information from documents or photo files that are handwritten and/or contain languages other than English, including those using non-Roman alphabets.
In addition to our long history of creating and implementing technical solutions to advance transparency, DG is uniquely positioned to develop this tool given our previous success in ensuring digital solutions are used and supported for years after development. After all, many tools in the anti-corruption space exist or are being created, but fewer are used and scaled. As we develop this tool, DG is committed to ensuring the tool’s design and usability are responsive to anti-corruption workers’ needs and ultimately, strengthen anti-corruption and accountability efforts.
Stay tuned for more as the tool is developed!
Share
Recent Posts
Diving into the DaYTA Program’s Data Collection Process
This blog explores key insights from the DaYTA program, offering practical guidance for researchers on effective data collection, overcoming field challenges, and leveraging local partnerships to enhance tobacco control efforts. This piece is especially timely following DaYTA’s workshop convening all 3 study country stakeholders to review the survey results and strategize on how best to disseminate this data to target audiences. This workshop took place from in Lagos, Nigeria, from November 18-20th.
Demystifying interoperability: Key takeaways from our new white paper
This blog post gives an overview on our latest paper on interoperability, implementing interoperable solutions in partnership with public administrations. Based on over 20 years of DG’s experience, the paper demystifies key components needed to build robust, resilient, and interoperable data systems, focusing on the “how” of data standardization, data governance, and implementing technical infrastructure.
More Smoke, More Stroke
In honor of this year’s World Stroke Day, observed annually on October 29th, this piece aims to raise awareness of the substantial burden of non-communicable diseases–particularly stroke incidents–using the case study of Nigeria, one of the main tobacco production hubs on the continent, in addition to Kenya.