Interview 
                        Moreover, the News Aggregator: Interview
                        with Jim Pitkow 
                        by Paula J. Hane 
  When it comes to real-time news, Moreover offers a wealth of options harvested
  continuously from more than 8,000 online news and information sources across
  115 countries in 23 languages. Articles are aggregated from a broad range of
  publications, including premium international and regional news sites, corporate
  Web sites, government press pages, Weblogs, discussion boards, and more. Moreover
  delivers content for use in custom applications, Web sites, search engines,
  corporate intranets, and enterprise portals. Jim Pitkow has led Moreover as
  CEO since May 2002. Previously, he was president and chairman of Outride, Inc.,
  a spinout from the Xerox Palo Alto Research Center (PARC) that was acquired
  by Google. He received a Ph.D. in computer science from the Georgia Institute
  of Technology in 1997 and is considered an expert in Web characterization,
  information retrieval, and human-computer interaction. In late January, we
  talked about Moreover's role as middleman in news provision as well as the
  company's plans for 2005. 
  Q:  Since the company began in 1998, its model has been one of aggregation
  rather than syndication. Can you explain how this works?  
 A:  Moreover was founded on the core premise that, with the Internet,
  new forms of distribution of content were possible. The requirement of actually
  having a physical copy of a publication delivered by a publisher was now antiquated.
  This fundamentally changed the nature of information dissemination and delivery
  and has allowed new business models to evolve. We think the core premise of
  most publications is circulation and growing their readership. The Internet
  enables a wider readership than was ever possible before. Part of what Moreover
  does is to deliver increased circulation and distribution, and, thus, enhanced
  audiences, to publishers.  
 What has really catalyzed our business in the last several years has been
  the role of aggregation, especially in the consumer marketplace. Businesses
  have long known the value of getting news from multiple sources. Look at what
  happened in 2004 when Yahoo! News and CNN were going tit for tat for the top
  online news destination site. One's an aggregated model, the other is a direct
  brand. Though Yahoo! has been in business for 6 or 7 years, it was covering
  the events of 2004 that actually showed the true power of aggregated news.  
 The industry is now beginning to reflect this. We now see AOL moving to an
  aggregated model and providing content from outside its garden wall. We see
  this with MSNBC and the Newsbot and, most recently, with Moreover's announcement
  of RSS feed aggregation for MyMSN. Content used to be restricted to MSNBC and
  its partners, but with this arrangement, users can get content from anywhere
  on the Internet. So, the walls are coming down. Aggregation and the ability
  to pull information from whatever sources you wantthis is what 2005 is
  all about. It puts the choice of information back in the hands of the consumer.
  This represents 100-percent disintermediation of the traditional publishing
  value chain.  
 Q:  How does this work with subscription-based sites such as The
    Wall Street Journal?  
 A:  We work with the publishers, and it's fine if they want to work
  within a subscription model. The Moreover model handles it gracefully. These
  sites are clearly demarcated within our system, and our customers can choose
  to filter them out or [to] include them. For example, MSN prefers not to have
  barriers put up to [its] users. But, for our corporate customers, The Wall
  Street Journal flows freely through our system, if you have a subscription.
  For us, it's just a flick of the switch to validate a publication.  
 So, we enable choice on both sides of the equation. Publishers can choose
  whatever model makes the most sense to them. Consumers and businesses can make
  their information choices. Moreover is the pure aggregatorthe middleman.  
 Q:  You have several versions of what you call CI-Metabase (Connected
  Intelligence), I believe, a consumer one and an enterprise version?  
 A:  Yes, we have various segmentations of our CI-Metabaseone
  is geared toward consumer interests; the other primary segmentation is for
  corporations. [The latter] tends to have niche, B-to-B trade publications that
  are very industry-specific [and] that would probably not be of interest to
  consumers on the open Internet. We work with partners on both sides and constantly
  refine these. And, there are many more subscription-based sources included
  in the enterprise version.  
 Q:  At one time, individuals could visit your site and sign up to
  receive e-mail with news headlines for specific topics. This is no longer available.  
 A:  We decided that the best business model was to deal with individuals
  indirectly, so we work through our partners now to reach those peopleand
  as a result, we reach many more people than we could have otherwise. And, this
  creates a clear channel for us. We don't compete with MSN for their userswe
  partner with them instead. MSN might compete with Yahoo! or Ask Jeeves, but
  they're not competing with us. We've become very neutral within the marketplace,
  and, thus, Moreover is the choice of multiple vendors.  
 We have the same dynamic in the search engine industry that we do in the
  reputation management industry. Companies like Biz360, 2B, Delahaye, and others
  have relationships with Moreover and rely on us to deliver that piece of their
  value chain, reliably and consistently. In comparison, a news aggregator like
  Topix actually has a destination Web site. We feel that this can create channel
  conflict. [Editor's Note: 2B has since been acquired by Factiva. Moreover
  has announced it will supply news for Factiva's Insight line of reputation
  management solutions.]  
 More importantly, looking at this from the end-user's experience, it's better
  for them to be able to get the news from these portal and destination sites
  than [to] have to come to us just for news. It makes more sense to have news
  integrated into a MyMSN page, along with weather, financial information, etc.  
 Q: Moreover has deals to supply news for some of the major search
  enginesYahoo! and Ask Jeeves. What about others, and what about Google?  
 A: Yes, we do supply many of the search engines and we have pending
  deals with several others. Google has an "invent our own" culturethey
  like to own the pieces. In a funny twist, it turns out that I went to grad
  school at Georgia Tech with the person who wrote Google Newsa great guy
  named Krishna Bharat. He's been into news for the past 10 years. Our solution
  was out in the marketplace with AltaVista 2 years before Google's offering.
  Every other company that faced the buy-build decision and looked at what Moreover
  is able to do found that we're able to amortize the costs and provide it at
  very reasonable price points. It's a naturalbuy from Moreover.  
 Q:  Google News uses an automated system to compile its news. In contrast,
  Moreover's system does categorization and provides human intervention in its
  process. Can you comment on the difference for users?  
 A:  Clearly, when you look at the limitations of the automated systems
  and the errors created, there's a lot of agreement that this creates a less
  desirable experience. News has a long tradition of using editors and human
  judgment to determine what is important, and we feel that this is the right
  treatment for this type of content. Our corporate customers clearly value the
  editorial work and the level of metadata that we add.  
 Q: I suspect that people who see your headlines on various sites may
  not be aware of the behind-the-scenes analytics that Moreover doesenriching
  articles with metadata and doing categorization.  
 A: We spent 100-plus person years building our editorial system from
  scratch. It's a combination of human editorial input with automated systems
  underneath, coupled with the latest in information extraction technology. We
  use a combination of lexical, semantic, and contextual analysis to reliably
  extract things like location. Unless there are sophisticated technologies behind
  the scenes, you just won't get precision and accuracy.  
 A lot of our data ends up in reputation management systems, which plow through
  hundreds of thousands of articles a day. If there are errors in a system and
  it scales across that number of articles, the system won't prove to be reliable.
  We have very high demands placed on our product from these enterprise environments;
  the Web search engine requirements for our product are actually an order of
  magnitude less demanding.  
 Q:  Where do you think search is headedwill some newer technologies
  (like clustering and visualization) be necessary add-ins to improve the search
  experience?  
 A:  I certainly agree that the tools for interpreting and accessing
  information have a long way to go. There are also behind-the-scenes things
  that need to happen, like spam detection and removal. There are also aspects
  of integrationunifying a user's desktop with Internet search and the
  search of specific databases. There's work to be done in personalizationbuilding
  trust with the user so that the more you know, the better chance there is of
  finding what they want. There are specific technologies that you mentioned,
  such as categorization and visualization, that are in service of those larger
  goals. Localization is another piece of this. These are some of the high-level
  themes that I know most of the search engines are working on. There's also
  multimedia. We just saw the news of Google moving into video, Yahoo! moving
  into video, Google moving into print, etc.  
 Q:  Doesn't Moreover have a broadcast content package that is sold
  just to enterprise clients?  
 A:  Yes, we launched that last year.  
  As opposed to the current offering from Google, we enable people to get an
    actual clip of the broadcast and the transcript. We either work with closed
    captioning or do the speech-to-text translation ourselves. We're starting
    to see the opening up and repurposing of mediabroadcast, print, [and]
    proprietary databases are all making their way onto the Internet.  
 Q:  With your news of supplying RSS feed aggregation to MSN, can we
  expect to see Moreover supplying broadcast content to the consumer side as
  well?  
 A:  That's a larger issue that is beyond the scope of this conversation.
  But, let me address for a moment the RSS news with Microsoft. Basically, RSS
  provides the ability for consumers to subscribe to content on their own termsagain,
  putting control back in the hands of users.  
 There are three things that make this an important announcement. First, this
  is the first consumer-facing product by Microsoft that embraces RSS. With the
  format war between Atom and RSS, this amounts to the 800-pound gorilla saying, "It's
  RSS." This is big in the blogging and standards communities. The second piece
  we touched uponMicrosoft has opened up the wall to include all content
  for the users' experience. Third is that there has been no major search engine
  or portal that searches the content of the blogsphere. Our solution enables
  users to find content, not only if the topic is in the description or metadata
  of the feed, but also if there are postings that incorporate the topic as well.
  This is powerful and makes MyMSN the first major portal to embrace that depth
  of search technology for blogs. This sets the stage for a possible blog tabjust
  as there was a video tab introduced today.  
 Q:  What are Moreover's plans for 2005?  
 A:  We see 2005 as a strong continuation of the momentum begun in
  2004. We'll continue to evolve our work with broadcast, do some cutting-edge
  things with RSS, and expand the content in our corporate products. We started
  with current awareness, expanded that to blogs, and, recently, to broadcast.
  We continually examine what makes the most sense to our users.  
 Q:  What value proposition does Moreover offer to prospective corporate
  customers as opposed to choosing a service like NewsEdge or Factiva?  
 A:  On the corporate side, our differentiation is pretty clear. Nobody
  can provide as much information as finely filtered and as fast as Moreover.
  The need for real-time, highly structured information from a variety of sources
  continues to be the business requirement. The limitations of the older models
  are beginning to be felt, particularly in timeliness. And, the information
  extraction we do to structure the data is key. We have 30-plus tagsnobody
  can compete with that.  
 Q:  Are you saying that Moreover's categorization is superior to what
  is done by Factiva or NewsEdge?  
 A:  Yes, we've had customers say they prefer our tagging and categorization
  over that of other companies.  
 Q:  But, of course, those services also offer rich archivessomething
  Moreover hasn't done.  
 A:  The standard Factiva product is 90 days of content, and the customer
  pays more for access to the archive. Our focus is on current awareness. We
  find the actual business use of archival and research material is not enterprisewide.
  It actually turns out to be a small group of individuals [who need that access].
  We find that the bigger opportunity is providing timely information to sales
  account reps, for example. Current awareness information can fold into the
  tapestry of an organization's information infrastructureit's a lot richer
  than it used to be.  
 Q:  What do you think is the biggest challenge you face as a company?  
 A:  Our biggest challenge is just scaling our growth. We have very
  clear signals from the marketplace that what we're doing provides tremendous
  value to our customers. We have ridiculously high retention rates. We need
  to remain smart and focused and move opportunistically into expanding directions
  to further compound profits.  
   
 
Paula J. Hane is Information Today, Inc.'s news bureau chief and editor of
NewsBreaks. Her e-mail address is phane@infotoday.com.
  |