Getting Content Under Control Is Simply Complicated

I spent last weekend reconfiguring and rebuilding the search indexes for a client. We indexed something like 3.5 million items, consisting of about three terabytes of data.

By Paul Rolich | December 15, 2009 at 07:00 PM

I spent last weekend reconfiguring and rebuilding the search indexes for a client. We indexed something like 3.5 million items, consisting of about three terabytes of data. It was a lengthy process, which provided me a lot of time to think about best practices for document management, document storage, and enterprise search. The first thing that struck me was the sheer magnitude of the document store. Granted, my client is a pretty big company, but I find it difficult to believe all those files are necessary and required for the day-to-day operations of the firm. In fact, we weren't indexing and searching the totality of documents in the organization, only those items contained within a particular collaboration/content management system.

Most of the content did not have any metadata associated with it beyond the obvious–author, time of creation, type of file, etc. So, the net result was a full text search and index of all those documents, which is not all that useful as corporate documents tend to use the same terms over and over, rendering search results that are difficult to deal with.

Paper to Bits?

Recommended For You

Want to continue reading?
Become a Free PropertyCasualty360 Digital Reader

Your access to unlimited PropertyCasualty360 content isn’t changing.
Once you are an ALM digital member, you’ll receive:

Breaking insurance news and analysis, on-site and via our newsletters and custom alerts
Weekly Insurance Speak podcast featuring exclusive interviews with industry leaders
Educational webcasts, white papers, and ebooks from industry thought leaders
Critical converage of the employee benefits and financial advisory markets on our other ALM sites, BenefitsPRO and ThinkAdvisor

NOT FOR REPRINT

Insurance
BenefitsPRO Broker Expo 2025 (BPRO)
May 06, 2025 - Boston

The premier educational and networking event for employee benefits brokers and agents.
More Information
Consulting
Consulting Top Consultants 2025
June 26, 2025 - New York

Consulting Magazine identifies consultants that have the biggest impact on their clients, firms and the profession.
More Information
Real Estate
GlobeSt. ELITE Women of Influence (WOI) 2025
July 21, 2025 - Denver

GlobeSt. Women of Influence Conference celebrates the women who drive the commercial real estate industry forward.
More Information

White Paper

PropertyCasualty360

Don’t miss crucial news and insights you need to make informed decisions for your P&C insurance business. Join PropertyCasualty360.com now!

Unlimited access to PropertyCasualty360.com - your roadmap to thriving in a disrupted environment
Access to other award-winning ALM websites including BenefitsPRO.com, ThinkAdvisor.com and Law.com
Exclusive discounts on PropertyCasualty360, National Underwriter, Claims and ALM events

Already have an account? Sign In Now

Instant Insights /

Getting Content Under Control Is Simply Complicated

Recommended For You

Want to continue reading?
Become a Free PropertyCasualty360 Digital Reader

1 Georgia insurance carrier 'refuses' to pay $22M verdict

2 $13.8M settlement reached in Progressive total loss class action

3 An Inside Look at Contact Center Modernization in Insurance

4 Boost Underwriting Efficiency & Improve Risk Assessment for Commercial Property Insurance

5 National park areas with the highest death tolls

Top U.S. cities for low-cost, high-end rentals

How are you improving your clients’ insurability this spring?

Deadly storms bring flooding, destruction to Central U.S.

PropertyCasualty360

Don’t miss crucial news and insights you need to make informed decisions for your P&C insurance business. Join PropertyCasualty360.com now!

Instant Insights /

Getting Content Under Control Is Simply Complicated

Recommended For You

Want to continue reading?Become a Free PropertyCasualty360 Digital Reader

Trending Stories

1 Georgia insurance carrier 'refuses' to pay $22M verdict

2 $13.8M settlement reached in Progressive total loss class action

3 An Inside Look at Contact Center Modernization in Insurance

4 Boost Underwriting Efficiency & Improve Risk Assessment for Commercial Property Insurance

5 National park areas with the highest death tolls

Events

Recommended Stories

Top U.S. cities for low-cost, high-end rentals

How are you improving your clients’ insurability this spring?

Deadly storms bring flooding, destruction to Central U.S.

Resource Center

PropertyCasualty360

Don’t miss crucial news and insights you need to make informed decisions for your P&C insurance business. Join PropertyCasualty360.com now!

Want to continue reading?
Become a Free PropertyCasualty360 Digital Reader