Building a documentary fund: collecting data useful for technological or competitive monitoring

Building a knowledge base is a key step in effective technological or competitive intelligence. Despite the rise of generative AI, only a methodical, tool-based, and multi-source approach can build a reliable knowledge base. TKM shares 20 years of expertise to structure and leverage a relevant, enriched, and strategic knowledge base.

Building a documentary fund: collecting data useful for technological or competitive monitoring

Building a knowledge base is a key step in effective technological or competitive intelligence. Despite the rise of generative AI, only a methodical, tool-based, and multi-source approach can build a reliable knowledge base. TKM shares 20 years of expertise to structure and leverage a relevant, enriched, and strategic knowledge base.

How to build a documentary fund? What a strange question! Especially at a time when chat and generative AI would have us believe that it has never been easier to find useful information to manage your innovation strategy. And yet, the risk of missing out on key information has never been higher. 

And magically generative AIs don't work miracles. Without even mentioning hallucinations, we at TKM find that a search on any of the major models (we test them all, even if we have a preference for Mistral... and that will be the subject of a future article!) only brings back a small portion of the useful information. 

Skeptical? Cécile is a construction engineer working for a large group with which she has already filed six patents. During a recent research project on her favorite topic (adhesive and cement additives), and despite several attempts, only two of her patents have been "resurfaced"... 

As part of another watch, the "cat" brought us into its clutches a scientific article that appeared extremely relevant (with the university, the authors... and even the DOI!). We checked immediately and this article did not exist... 

DOI: acronym for Digital Object Identifier , it is a unique, permanent and verifiable identifier assigned to a digital publication in order to ensure its traceability and long-term access.


Generative AI produces many useful services, and it is integrated into our IPMetrix software suite , but it cannot reasonably be used to power robust and methodical monitoring. 

In this context, carrying out a state of the art , of the competition or monitoring your technological and industrial environment therefore depends above all on your capacity (or that of your team in charge of monitoring) to collect all the information potentially useful for your decisions. 

In other words: to constitute an exhaustive and relevant documentary fund , based on one (or more often several) research strategy. 

Furthermore, thanks to this fund, once the monitoring team has conducted the analyses, R&D or IP (industrial property) analysts will be able to export, comment on, and share a particularly interesting and inspiring patent, thesis, or collaborative project. But for this, it must first be collected. 

The purpose of this article is to share with you some tips and tricks inherited from more than 20 years of experience of TKM in monitoring companies of all sizes (SMEs, ETIs, startups or GG). 

Read also → Which technology monitoring tool should you choose in the face of the explosion in the number of patents?

Build up a complete and relevant documentary fund  

Building a useful and effective documentary fund is one of the first steps of monitoring, of which it constitutes the cornerstone. Not only will it be necessary to be: 

  • exhaustive (collect everything that is likely to be relevant), 
  • while limiting noise (which we ultimately accept as irrelevant documents but impossible to extract at the search strategy stage), 
  • and above all by spending the necessary time and no more , the most important effort having to be reserved for analysis, exploitation, dissemination and collective and strategic enrichment. 

TKM's IPMetrix
software seamless, integrated process that will streamline data collection, enrich raw data, and leave maximum space and functionality for analysis, sharing, and capitalization. But let's start at the beginning. 

What is a documentary fund?? 

A documentary fund brings together all the documents, data and sources of information collected on a topic of interest (a technological field or an industrial sector). 

This organized collection provides businesses and analyst teams with a reliable and relevant knowledge base to support the strategic management of their organization. 

More than just a collection of documents, a well-structured documentary fund allows rapid access to verified, enriched, organized information ready to be analyzed by knowledgeable teams (IP, R&D, Innovation or strategic marketing). 

The importance of the documentary fund 

In the current context of infobesity , where information is abundant but not always of good quality, having a complete and structured documentary fund becomes a strategic asset . 

It allows you to: 

  • Centralize knowledge from various sources. 
  • Filter and enrich raw information. 
  • Capitalize on this information. 

Infobesity refers to the overload of available information, the quantity of which often exceeds our processing capacity and makes it difficult to identify truly relevant content.

Building a documentary collection : a major challenge 

Creating and managing a documentary fund represents a major challenge, particularly due to the diversity of information sources (specialized databases, websites, scientific publications, patents, etc.) and the need to constantly update the data collected. 

Moreover, the process of collecting, organizing, and analyzing information requires a considerable investment of time , not to mention the specific skills needed to distinguish relevant information from superfluous data. 

Read also → Prior art search: a strategic step for all companies, essential for SMEs

The steps to follow to build a documentary fund

The creation of a documentary fund involves the development of a clear and formalized research and collection strategy, directly linked to the strategic question(s) to which the monitoring is supposed to respond.

A research strategy corresponds to the structured method (keywords, sources, operators, filters) implemented to collect in an exhaustive and relevant manner the information necessary for monitoring.

This is a process that involves several key steps , each requiring attention and methodology. Here is a guide to navigate these steps and develop a structured and relevant documentary collection.

1. Determine the search scope

The first step is to clearly define the topic of your monitoring. This involves specifying the area to be investigated, the specific questions you wish to answer, and the objectives of your research. This essential step allows you to more effectively target subsequent research and avoid collecting unnecessary information .

2. Select relevant sources

Not all sources of information are equal. And a single source of information is rarely enough. This is why, since its creation in 2004, TKM has always prioritized the ability to collect and analyze a wide variety of information sources in the development of its software suite : patents, scientific publications, the web, theses, conferences, collaborative projects, startups, etc.

It is therefore essential to identify those that will be the most relevant before embarking on the creation of a state of the art or the configuration of a periodic monitoring. Some databases are free, others are paid. The costs can quickly become a limit. This is why TKM also offers privileged access to its own Data Lake , a global database of patents, scientific articles, projects and startups.

3. Implement a collection strategy

Once the search scope is known and the sources identified, it is time to list the keywords that will enable this exhaustive and relevant collection. Each database has its own " grammar ," and it is useful to know it.

However, most often Boolean operators (AND, OR, NOT), truncation operators or even proximity operators (NEAR, W..) will help you to fine-tune sets of keywords that will allow you to combine efficiency and exhaustiveness. Ideally, we will tend to keep a little noise (10 to 15%) rather than risk missing useful information.

But if this exercise becomes complicated, at TKM the team of analysts, composed exclusively of high-level scientists, has developed real expertise on the subject over the years.

4. Organize and index documents

Nowadays, and whatever the subject, collection most often results in a significant volume of documents (several hundred to several thousand or even tens of thousands). To begin with and before the analysis phase, a phase of curation, enrichment and careful organization is required .

This can involve adding useful metadata, extracting named entities useful for your research, or normalizing affiliations (names of organizations) and semantic indexing of documents.

Then the implementation of a classification system, or even tags or selection will be useful to facilitate subsequent searches : yours but also those of the client teams in the end: management, R&D, IP, innovation, corporate venture, etc.

5. Extract useful information

Simply collecting information isn't enough. But if your collection of documents includes several hundred documents, you probably won't want to read everything... and in fact, it's not desirable.

Tools for semantic segmentation, research, weak signal detection and even “outliers” will be extremely useful for:

  • reinforce your vision of the field (what you already knew),
  • (emerging technology, startup, new patent, strategic pivot of a competitor, etc.).

By segmenting your documentary collection , and without having to read everything, you will be able to build strategic intelligence on the subject , initiate an internal collective intelligence process and thus support the strategic management of your organization.

And finally, some particularly inspiring or strategic documents (opportunity or threat) will require in-depth reading.

The purpose of the document collection is to give you the opportunity to approach your strategic issues at a global level, while allowing for back-and-forth at the level of individual documents.

The strategic intelligence you build then takes on another dimension, and never leaves your internal customers or your management indifferent. The goal is to transform raw data into useful knowledge for decision-making.

6. Feed and update your fund

A documentary collection is a living tool that may require constant updating and enrichment to remain relevant. Even in the case of one-off monitoring (mapping, state of the art, landscaping, etc.), it is quite common for the decision to upgrade it to periodic monitoring.

The pace of this update will depend on the strategic intensity of the issue, but also on the technological movement in the field. Thus, monitoring AI or cosmetics requires a monthly update at a minimum, given the volume of knowledge produced by research and industry players.

For this reason, among others, it is essential to keep a precise formal record of the search strategies used .

Then apply them regularly to the different sources, unless you have access to a global document fund like the TKM Data Lake, thanks to which updates can be easily automated.

You'll also need to run the new data through the various steps outlined in this article. This process can be tedious and can quickly turn into a nightmare for the monitoring analyst. But certain tools can save you valuable time.

IPMetrix : the ideal tool for building your complete, capitalized and regularly updated documentary funds 

With its integrated approach, IPMetrix transforms the complexity of intelligence into a structured and efficient process. Let's see why and how! 

Complete processing of standby kinematics 

From data collection to dissemination, including research and analysis, IPMetrix covers all the steps necessary for effective strategic monitoring . First of all, it allows analysts to concentrate in a single knowledge base all the sources useful for resolving the strategic question, the origin of the monitoring need. 

There are thus more than 250 types of sources or databases that can be grouped within a project in IPMetrix.  

Data import and integration 

IPMetrix offers flexible solutions for adding new data to the document collection, whether it is external documents, database connections via API, or website integration by entering URLs . This flexibility guarantees comprehensive and up-to-date monitoring. 

Creation of relevant documentary funds 

With IPMetrix , users can create documentary collections on specific subjects by integrating data from multiple sources. The ability to create organized and thematic document collections then facilitates the work of analyzing and building strategic intelligence on the subject. 

Designed for collaborative monitoring 

The collaborative aspect of IPMetrix is ​​a major asset. By allowing the sharing of comments, analyses, documents and insights within the team, it allows the emergence of collective intelligence, of which monitoring is then the catalyst. This group dynamic not only enriches the quality of monitoring but also, and above all, it places it at the heart of the company's strategic management. 

Advanced alert management 

IPMetrix 's customizable alerts ensure continuous monitoring, notifying users of any relevant news based on their areas of interest. Extending IPMetrix into a simplified interface, the Front View, extends the monitoring and innovation community to everyone in the company. 

Powerful analysis tools 

The tool stands out for its data analysis and , allowing it to detect trends, compare information, and synthesize complex insights. These features transform the document collection into a strategic resource that can be directly used for decision-making. 

Focus on the use cases of TKM and IPMetrix 

TKM and its IPMetrix offer versatile intelligence and decision support solutions that can be adapted to a variety of use cases in the field of R&D and innovation . Whether companies are looking to conduct a 360° study on a specific area, identify emerging trends or monitor the competition, TKM offers a personalized approach that meets the specific needs of each client. 

Here's how TKM and IPMetrix position themselves as invaluable resources in different usage contexts. 

For businesses seeking a comprehensive understanding of a field 

When a company wants to obtain a complete overview or a state of the art on a specific sector but lacks internal expertise or time, TKM intervenes by creating a detailed documentary fund in IPMetrix . This service provides the client with a solid basis for conducting their analysis, based on an exhaustive and relevant collection of information. 

Autonomy or personalized support 

With the acquisition of IPMetrix , customers have the choice between operating the tool completely independently or benefiting from personalized support via TKM's integrated services. For customers who prefer a turnkey approach, TKM offers customized monitoring services , including the regular collection and cleaning of documents, as well as their integration into the client's document collection on IPMetrix. 

Hybrid delivery : a flexible model 

This hybrid delivery model underscores the flexibility of TKM's offering. It adapts to each client's internal structure and capabilities. For companies without dedicated intelligence staff, or for those that prefer to focus their resources on strategically analyzing information rather than collecting it, TKM handles the entire process.  

This frees companies from the operational burden of monitoring, allowing them to focus on leveraging insights to support their strategic decisions. 

Personalized data collection 

IPMetrix is ​​designed to facilitate the collection of targeted data based on the specific needs of its clients. Whether through databases you already have access to, or through targeted web searches, IPMetrix allows you to gather a rich and relevant documentary collection. You can import documents, connect databases via API, or simply add URLs to enrich your monitoring. 

By offering a complete monitoring platform, from data collection to analysis, IPMetrix transforms information overload into a competitive advantage. Whether companies opt to use the tool independently or prefer to benefit from tailored support, TKM adapts its services to each specific need.  

By making business intelligence more accessible, IPMetrix empowers decision-makers to confidently navigate a complex information landscape and make informed decisions for the future of their business. Want to learn more? Contact us.

Other articles

Banner of the article on TKM's commitment to the climate

Climate Business Convention: TKM on the road to responsible innovation

Through the Business Climate Convention, TKM is strengthening its commitment to responsible and sustainable innovation. In this interview, Julie Lambert, Director of Customer Experience and "Champion for the Planet," shares the concrete actions taken to reduce the company's impact, support local initiatives, and assist customers in their sustainable technology choices.

Learn more

FOLLOW OUR NEWS BY SUBSCRIBE TO THE NEWSLETTER

Optimized with PageSpeed ​​Ninja