Dec 3, 2024

Preparing Your Files for AI Success

Note: This is the second post in a series about how financial firms can prepare their internal data for AI search. It is written with a non-technical or semi-technical audience in mind. If you have any questions about the content here, feel free to shoot a note to john@rogo.ai.

Intro

In the first article in this series, we explored why you, as a financial firm, need to “get your files ready” to leverage AI for internal search. For anyone who’s attempted to implement internal search at scale, the reasons we discussed will sound familiar. Here’s a quick recap:

With that foundation covered, let’s dive into the next step: How do you prepare your files for AI search?

Start with the Users

The most critical (and often overlooked) step is starting with your end users. This may sound obvious, but like eating well or exercising to improve mental health, it’s a fundamental truth we often resist.

The way you prepare your file system must align with the specific user workflows you aim to support.

Here’s the good news: You don’t need to interview every single employee. In a 1,000-person firm, talking to 30 users will likely surface recurring themes. Focus your conversations on these three questions:

  1. What use cases are they targeting?

  2. What files do they need? Which files are irrelevant or distracting?

  3. What context do they have when asking a question?

The last question is especially important. Do users know the company name, date, or sector? Are they searching for a specific valuation method? Or are they coming in blind?

This context—the information users supply at query time—is what the AI system uses to filter and refine its search. Your system must be structured to interpret and act on this context effectively.

Identify Key “Swim Lanes”

“Swim lanes” are structured use cases: predictable searches where users can reliably expect strong results. While the goal remains open-ended search across your files, swim lanes help focus your efforts initially.

A different article in this series will cover which use cases are best suited for internal search. For now, here are some illustrative swim lanes:

  1. Industry Credentials

    • Example: What recent IPOs have we done in EdTech?

  2. Pulling Numbers

    • Example: What were our entry EBITDA multiples for these transactions?

  3. Finding Comps

    • Example: What high-growth SaaS companies did we comp XYZ to in last year’s pitch?

  4. Summarizing Content

    • Example: How have we positioned sponsor-owned CPG companies in distress?

  5. Tracking Deal Teams

    • Example: Who was the ECM banker on these pitches?

Once you identify swim lanes, focus on designing workflows that make these searches as seamless as possible.

Map Out Informal Knowledge

For each swim lane, create 5-10 example questions. Then, work with users to document how they would answer those questions manually.

This step uncovers the informal knowledge embedded in your firm. It might explain why a user prefers an EMEA folder for an American company, or why they check the valuation report dated right after a deal closes rather than the latest one.

This is your golden ticket. Documenting these workflows not only highlights your firm’s unique practices but also reveals patterns that can guide your AI system.

Organize and Process Your Files

Now comes the hard part: translating user workflows into a structured, AI-ready file system. For each swim lane, describe how an analyst would answer a question. Then, think about how the AI system can replicate that process.

Here’s a toy example:

  • Swim Lane: Industry Credentials

  • Example Question: What are our recent EdTech IPOs?

  • Workflow:

    1. Pull all deal folders in the sector.

    2. Identify which deals were IPOs.

    3. Cross-check dates for relevance.

    4. Locate tombstone pages and extract IPOs.

    5. Compile results into a centralized list, excluding deals that didn’t close.

From this workflow, it’s clear the AI system will need metadata for sector, deal type, and date. This metadata might come from file content, tags, or folder structures, depending on what’s available.

Address Common Challenges

As you proceed, you’ll likely encounter hurdles. Here are a few common ones and how to tackle them:

Iterate and Refine

With workflows and metadata structures in place, the real work begins: testing and iterating. Start small, gather user feedback, and refine your processes.

Key Takeaways

  • Focus on high-impact use cases first.

  • Embrace feedback to refine workflows.

  • Remember: The goal is progress, not perfection.

In the next article, we’ll delve into the technical details of indexing, tagging, and embedding files, as well as strategies for keeping your system up to date. But for now, the groundwork you’ve laid is the most critical step.

By aligning your file system with user workflows, you’re not just enabling AI search—you’re building the foundation for your firm’s future in data-driven decision-making.

Learn how Rogo can help your firm

Book a demo to get started

Learn how Rogo can help your firm

Book a demo to get started

Learn how Rogo can help your firm

Book a demo to get started