Digital Information: Gaps in Knowledge Understanding and Access

January 30, 2013

Stephen Arnold took a conversation he and a few of our colleagues had and wrote more about it in his Beyond Search blog. “Thoughts about Commercial Databases: 2013” is worth a review and I’ve added a few of my own thoughts here for your consideration.

The conclusion of our discussion is summed up nicely  by Arnold in that the digital future of information companies is gloomy and his post outlines a few familiar names in the world of libraries.

  • Ebsco Electronic Publishing (everything but the kitchen sink coverage)
  • Elsevier (scientific and technical with Fast Search in its background)
  • ProQuest (everything but the kitchen sink coverage plus Dialog)
  • Thomson Reuters (multiple disciplines, including financial real time info)
  • Wolters Kluwer (mostly legal and medical and a truckload of individual brands)

During our discussion the questions was posed, how can database companies grow? The short answer is their are no obvious growth patterns beyond acquiring other information publishers. A point that caused one in the group to say, eventually the beasts will begin eating themselves because of the hunger when there is no fresh meat. Amusing and yet frightening.

Arnold quotes “Why Acquisitions Fail: The Five Main Factory by Pearson Education” to explain the key factors in why acquisitions result in problems rather than soaring success.

The fact that library budgets continue to shrink, open access continues to grow and  large database companies fail to adjust business models for these realities causes deep concern for the researcher in me. As Arnold states:

The business model for these firms has been built on selling “must have” information to markets who need the information to do their job. The reason for the stress on this group of companies is that the traditional customers are strapped for cash or have lower cost alternatives.

Other concerns abound as well. As libraries continue to limit access to physical collections thanks to the value of library real estate, strain is placed on the serendipity of browsing researchers. Digital research presents its own challenges. It often leaves one feeling as though they have retrieved a a good match but is it really the best and is it complete? When you add in that many of today’s students, even those training to be librarians,  do not successfully distinguish between source and provider in the electronic age, the concerns for access, understanding and knowledge abound.

While Arnold concentrates on the outlook of commercial databases and even suggests that an acquisition by Google to monetize the content with ads could be a shift in the future of information publishing, there are other concerns to ponder.  Curated content has a future, but what that future holds in terms of commercial versus open access is yet to be thought out in light of what Arnold suggests as the trend for 2013 commercial databases.

Those who think that public search companies are keeping the archive of all digital information are in for a rude awakening. Librarians and information professionals need to get beyond teaching people how to search. As professionals, we have a duty to understand the business pressures of our information suppliers, free or fee, and what those pressures do to the availability of yesterday’s information in today’s reality of right now access.

Information professionals must think about and prepare for the inevitability of lost information. The Way Back Machine may be expanding their database but they are not archiving the complete history of companies that are no longer in business. Think about the number of start-ups that are no longer around, who were the corporate officers, what was their credit history? The gaps in corporate information mean that there will be gaps in ongoing competitive intelligence.

This is a simple issue on the surface with unfolding complexities that warrant thought and planning and action. Just as the burning of the Timbuktu library means of loss of valuable information, so too do the cost pressures and lack of access and exposure to digital data.

Innovation on the commercial side seems nearly impossible. Curation and access on the public search side is limited by the ability for providers to drive their profit in light of their own business models. Open access is being challenged to the point where advocates such as Aaron Swartz ends his own life. The Library of Congress is archiving Twitter when they may be better serving the longevity of knowledge and information by archiving the “free” information on the world wide web.

Of course, the practical part of me that understands that daily life grinds on no matter what understands that this is a good intellectual argument. In the long run will this have a significant impact on daily life? Probably not. It is something that when I think about the history of knowledge and culture, gets my mind whirling. Business will do what businesses do, libraries will do what libraries do and maybe just maybe the digital gaps won’t cause overwhelming repetition of mistakes.

Either way, it is fun to think and share and get input from intelligent colleagues.

Constance Ard, January 30, 2013



Friends of Answer Maven Launch New Business Blog

January 14, 2013

I’m so excited to share news with you about a colleagues latest venture. They have released The Team Building Factory Blog today.  The posts will cover leadership development, employee engagement and other business savvy advice.

Is this meeting space or Living Room The folks at Workshop, the Creative Workplace have a very unique business and one that offers me a great space for meetings for both myself and my clients. In addition, they work with businesses throughout the United States to help develop their teams into more productive, more successful and more cohesive units.

Each team adventure has a give back component that strengthens not only the team’s presence within the Louisville community, it also underlines Workshop’s desire and dedication to paying it forward.

So check out their blog and give them a shout out for this great new undertaking. I promise you will gain added value from the effort.

Constance Ard, January 14, 2013


Big Data Meets Predictive Coding: Economic Impact To Be Determined

December 10, 2012

Litigation, especially for high profile companies means big money. In the age of big data, litigation and eDiscovery means big money too. So this Corporate Counsel article, “Change Is Coming: The Evolution of E-Discovery Economics,” caught my attention.

If you have ever done a Google keyword search, even if it is a custom search, you realize that keyword search is not efficient in returning narrow, accurate, relevant results. In a deluge of data, when time and costs matters, keyword searching is probably not the most efficient method of reviewing and producing documents in response to a discovery request. However, the article does make the valid point that until recently keyword search was the best we had available.

Now we enter the age of predictive coding. This technology opens an entirely new view of the massive landscape of structured and unstructured data.

Predictive coding is software that is trained by a user to predict which documents in a document set will be responsive and which will be non-responsive. Predictive coding goes by many names, including computer-assisted review and technology-assisted review.

Predictive coding aims to reduce the number of documents reviewed by ranking the documents according to a calculated level of responsiveness. Instead of looking at every email written by a custodian over a three-year time period, predictive coding uses a number of factors including keywords, writing style, subject matter of the writing, and even punctuation style to determine the chain of documents that are most relevant to the matter. These underlying programmable algorithms vary between software brands.

The discussion in the Corporate Counsel article is lengthy and worth reading carefully. Predictive coding is hot in the technology solutions landscape and Recommind was one of the first in the market space. However, as we all know, the world of law is a bit slow to embrace the newest technology. Someone else needs to test the water and find out if it is too hot, too cold, or maybe it is the Baby Bear solution to eDiscovery and offers the just right option. 

I think the massive amount of information that is being created, retained and therefore open for discovery needs more than predictive technology. I think active information management policies and procedures will be a key to the effective cost control of the evolving land of litigation.

Constance Ard, December 10, 2012


Lawyers Hold Ultimate Responsibility for eDiscovery Processes: Are You Prepared?

December 7, 2012

Sometimes risk management as related to eDiscovery takes on new levels of quality control. As I read a recent Metropolitan Corporate Counsel article, “Risk Management and Quality Control of E-Discovery Vendors” the complexity of eDiscovery was driven home.

Understanding the capabilities, quality and costs related to using eDiscovery vendors is an important responsibility. The reason being, the attorney has ultimate responsibility.

This article offers some very good reminders about what should be asked, inspected and known about your vendors. It is more than a processing capability. Is the data truly secure? Is your vendor using other providers?

As the article points out:

The consequences are steep for failing to be fully engaged with e-discovery vendors. In J-M Mfg. Co., Inc. v. McDermott Will & Emery, No. BC 462 832 (Cal. App. Dep’t Super. Ct. L.A. Cnty. filed June 2, 2011), the defendant faced a legal malpractice suit when it allegedly did not carefully review the work of contract attorneys at an e-discovery vendor, resulting in the production of almost 4,000 privileged documents to the federal government in a whistleblower suit.

The need to be involved and to understand the capabilities of your vendors is important. Understanding in-house processes is just as important. The dimensions of where information is stored, how it is accessed and the management involved are critical components of knowing what, when, how and who related to eDiscovery processes.

Constance Ard, December 7, 2012


Information Management: The Lost Leg of EDRM

December 6, 2012

How big is eDiscovery? Really big! In a CMS Wire article, “Overview of e-Discovery Across the Microsoft Office Platform #spc12” we see that Microsoft is pushing into the technology wolf pack that is circling the corporate eDiscovery wagon.

At a recent Sharepoint Conference, author Mike Ferrera sat in on a session by Quentin Christensen, Program Manager at Microsoft and learned just how Microsoft was addressing the eDiscovery landscape.

Quentin began the session by talking about the nuances of e-Discovery, specifically Identify and Preserve, Search and Process, Review and Produce, and how Microsoft plans to address these. The main driver behind Microsoft’s e-Discovery push is in-place hold, query and export, which will give the legal team the ability to create case sites very quickly without having to call IT.

Like many technology vendors that offer eDiscovery solutions, Microsoft is paying less attention to the front end of the EDRM, especially the information management portion.

A company that seeks to get the best bang for their information technology and information management investments will do well to create a data map, and put in place governance measures that make the preserve, search and process work more cost-effective to manage.

Constance Ard, December 6, 2012


Model to Collaborate and Communicate Information Governance

December 5, 2012

I was browsing around the EDRM site last week and took a look at the developments for the Information Governance Reference Model. If you haven’t looked at this before, now might be a good time to do so.

This framework addresses some model behaviors that can lead to a high value business process that includes protective measures.

The IGRM Project aims to offer guidance to Legal, IT, Records Management, line-of-business leaders and other business stakeholders within organizations. The Project seeks to facilitate dialogue among these stakeholders by providing a common language and reference for discussion and decision-making based on the needs of the organization.

The various departments and stakeholders involved in the use, management and compliance needs of organizational information may be able to use this tool to communicate standards and policies that will allow for better protection. Enterprise information is a high value commodity within any company. Using it, managing it and keeping it fresh can be a strong competitive edge. This model may be instrumental in helping you find a way to remain competitive.

Constance Ard, December 5, 2012


Follow

Get every new post delivered to your Inbox.

Join 351 other followers