CHI '95 ProceedingsTopIndexes
DemonstrationsTOC

Interactive Data Visualization at AT&T Bell Laboratories

Stephen G. Eick and Brian S. Johnson * AT&T Bell Laboratories 1000 East Warrenville Road, Room 1U-328 Naperville,Illinois, 60566 USA {eick,bsj}@research.att.com 708-713-4070

© ACM

Abstract

Visualization is a key technology forunderstanding large bodies of data. Our approach to visualizing abstract, non- geometric data involves a reduced-representation overview, multiple linked views, filtering and focusing techniques to reduce visual clutter, color, anda highly- interactive user interface. The reduced representations allow users to see the entire data set in one view while still providing immediate accessto relevant detail and answers to specific questions in the linked views. Wehave developed a software infrastructure embodying our design principles for producing novel, high-bandwidth visualizations of corporate datasets. Our approach to abstract data visualization is one the best off-ramps on theinformation superhighway.

Keywords:

Visualization, Graphic Interaction, Abstract DataVisualization, Database Visualization, Data Mining

Introduction

Just as spreadsheets revolutionized our ability to understand small amounts of data, visualization will revolutionize the way we understand large corporate datasets. Our research focuses on extracting the information latent in large databases using novel visualizations. The difficulty in extracting this information is understanding the complexity of the databases. To aid in this task, we have created many novel, highly interactive visualizations of large datasets. Our research involves developing the techniques, software tools, and infrastructure to mine knowledge fromcorporate databases so that it can be put to competitive and commercial advantage.

INTERACTIVE VISUALIZATION

Most of today's interfaces to large data sets show only a few aggregate items at a time. Our goal is to use every available screen pixel to show as much data as possible, thereby providing local detail in a global context. To achieve this goal we use interactive techniques to solve the clutter problems associated with information-dense displays.

Compact Graphic Representation

Navigation is frequently anissue in the design of interactive systems dealing with large information spaces. Compact graphic representations can provide global perspective in aninformation-rich environment, thereby maximizing data accessibility and minimizing navigation. These compact representations take full advantage of perceptual cues (size, position, color, depth, sound, etc.).

Highly Interactive Linked Views

The power of our representations is magnified through the use of interaction and linked views Each view, whether custom or standard (color keys, bar charts, box plots,histograms, scatter plots, etc.), functions both as a display and a control panel. Selecting and filtering data in one view instantly propagates to the otherviews, thereby providing additional insights. Linking multiple views interactively provides an integrated visualization far more powerful than the sum of the individual views.

SYSTEMS

Our systems have been used to successfully analyze and present software version control information, file systems, budgets, network traffic patterns, consumer shopping patterns, relational database integrity constraints, resource usageon a compute server, etc. The amount of information that our systems present on a single screen is between 10,000 and 1,000,000 records. The systems we have built include:
SeeData relational data
SeeDiff file system differences
SeeLib bibliographic databases
NicheWorks[1] abstractnetworks
SeeLog time-stamped log reports
SeeNet[2] linked geographic data
SeeSlice[3] program slices and codecoverage
SeeSoft[4] lines of text in files
SeeSys[5] hierarchical software modules
SeeTree[6] hierarchical data

Since the needs of each user are unique, our visualizations are task-oriented. Our most successful visualizations help frame interesting questions as well as answer them. Our visualizations:

EVALUATION

Our research interest is in visualization techniques that scale to industrial-sized systems. By applying our tools toreal problems and producing working software, we gain an increased understanding of how visualization methodology works in practical situations. This enables us to discover the fundamental insights and formulate the guiding principles for effectively extracting information using visualization. By exercising our tools, we gain insights into critical issues, which enables to us refine our design principles and improve our capability for rapidly producing novel, custom, information-rich displays. Our systems have enabled users both to 1) gain newinsight into their data and 2) improve performance of existing data analysis tasks. For example, a formal evaluation of 40 subjects presented with a budgetanalysis task found a graphic visualization interface to be 50% faster overallthan a comparable dynamic outline interface [6].

SOFTWARE AND TECHNOLOGY

Underlying all of our visualizations is a common infrastructure embodied in a C++ library that handles interaction, graphics, and data linking. This C++ Visualization Library helps us to:

Minimize our development time,
Encapsulate expertise and design principles,
Build cross-platform systems (UNIX/X11, Open GL, and PC/Windows), and
Keep visualization application code small.

CONCLUSION

Visualization is a key technology that canhelp users understand the complexity in industrial-sized systems. We havedeveloped many interesting and novel visualizations of a variety of large andcomplex data sets. We would like to share and discuss our approach to abstract data visualization with others working in the area of interactive interfaces to complex information.

ABOUT THE AUTHORS:

Stephen G. Eick is the Technical Manager of the Data VisualizationResearch group at AT&T Bell Laboratories. His educational background includes a B.A. from Kalamazoo College (1980), M.A. from the University of Wisconsin at Madison (1981), and a Ph.D. in Statistics from the University of Minnesota (1985). Eick is an active researcher, is widely published, and holds severalsoftware patents. He is particularly interested in visualizing databases associated with large software projects, networks, and building high-interaction user interfaces. Brian S.Johnson is a Member of Technical Staff in the Data Visualizaion Research groupin the Software and Systems Research Center at AT&T Bell Laboratories. Hereceived his B.S. in Computer Science from the University of Minnesota (1987),and his Ph.D. in Computer Science from the University of Maryland (1993). Heis actively involved in using visualization techniques to understand the abstract information associated with large corporate data sets.

Acknowledgments

The research presented is a joint effort of the following people: Jackie M. Antis, David L. Atkins, Thomas J.Ball, Kenneth C. Cox, Stephen G. Eick, Brian S. Johnson, Paul J. Lucas, John D.Pyrce, Anselm Spoerri, Joseph L. Steffen, Graham J. Wills.

DEMO References

  1. Eick, S.G. andWills, G.J. Navigating Large Networks with Hierarchies, in Proc. IEEEVisualization ‘93, 1993
  2. Becker, R.A., Eick, S.G.,Miller, E.O., and Wilks, A.R. Dynamic Graphical Analysis of Network Data. inProc. ISI, 1989.
  3. Ball, T.J. and Eick, S.G. Visualizing Program Slices, in Proc. IEEE VisualLanguages, 1994.
  4. Eick, S.G., Steffen, J.L., and Sumner E.E.: Seesoftô- A Tool forVisualizing Line Oriented Software, IEEE Transaction on Software Engineering11,18 1992.
  5. Baker, M.J., and Eick, S.G. Space-Filling SoftwareVisualization, in Proc. Int. Conf. Software Engineering, 1993.
  6. Johnson, B.S. Treemaps: Visualizing Hierarchical and Categorical Data, Ph.D.Dissertation, University of Maryland, 1993.