gms | German Medical Science

50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds)
12. Jahrestagung der Deutschen Arbeitsgemeinschaft für Epidemiologie (dae)

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie
Deutsche Arbeitsgemeinschaft für Epidemiologie

12. bis 15.09.2005, Freiburg im Breisgau

WISDOM: Supporting Biomedical Research Through a System for Web-based Interactive Study Design, Operation and Management

Meeting Abstract

  • Claudia Peissert - The Rockefeller University, New York, NY
  • Alexandre Peschansky - The Rockefeller University, New York, NY
  • Knut M. Wittkowski - The Rockefeller University, New York, NY

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. Deutsche Arbeitsgemeinschaft für Epidemiologie. 50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds), 12. Jahrestagung der Deutschen Arbeitsgemeinschaft für Epidemiologie. Freiburg im Breisgau, 12.-15.09.2005. Düsseldorf, Köln: German Medical Science; 2005. Doc05gmds604

The electronic version of this article is the complete one and can be found online at: http://www.egms.de/en/meetings/gmds2005/05gmds399.shtml

Published: September 8, 2005

© 2005 Peissert et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en). You are free: to Share – to copy, distribute and transmit the work, provided the original author and source are credited.


Outline

Text

Introduction

As the amount of genetic, genomic, proteomic, and phenomic data is increasing at an unprecedented rate, the complexity of data management is also increasing. In addition, comprehensive analyses of data ranging from diplotypes of SNPs to gene expression and side effect profiles, require flexible statistical tools.

The traditional role of IT (informatics) support to clinical investigators has involved assisting them in the design of their databases after the study had been designed with assistance from a biostatistician. Once the data for an interim or final analysis has been collected, informatics support is provided for statistical analyses and the biostatistician assists in the analysis. At the RUH GCRC, we are developing a Web-based Interactive Study Design, Operation and Management module (WISDOM) to improve the overall quality of clinical trials by making knowledge about the study design available during all stages of development and progress of a clinical study.

Material and Methods

In a typical research environment, various software tools are used to design studies, write protocols, manage and audit data entry and storage, monitor safety, analyze data statistically. Maintaining consistency across such different, often idiosyncratic programs and documents is cumbersome and error-prone, because of redundancy. To match flexible tools precisely to a particular aspect of an analysis, both in terms of the experimental design and the biological background, a precise declaration of the medical and experimental context in which the data was observed (meta data) is necessitated. As statistical methods play an increasingly important role, the metadata to be represented depends to a large extend on the needs of the statistical methods. Formally distinguishing declarative and procedural knowledge between medical background, experimental design, and scientific objectives led to a representation of metadata in five layers corresponding to different levels of abstraction [1].

The ACCESS layer contains knowledge of how to store and retrieve data (e.g., SQL statements) from the different data bases and how to send it to different software packages. The SEMANTICS level contains a description of the experimental design, e.g., how the variables relate to each other and the design matrix underlying each of the statistical methods. The STRATEGY layer contains a description of the variables’ characteristics, such as causality (strata / interventions / observations), discrete / continuous, nominal < ordinal < interval < absolute, measurement units (MODEL), and the rules on how to find corresponding methods (STATISTICS). The DOMAIN layer describes knowledge about the variables, such as ranges/interactions or expected adverse experiences (MEDICINE), and specifications of the output, such as level of detail (PRESENTATION). At the INTERACTION layer, an audit of previous dialog actions is kept and the rights and preferences of each user are being described.

Results

Having meta data entered into the data base management system during the design stage of a study and made available during all subsequent steps allows for providing new informatics services, where the users are freed from the need of repeatedly entering such information [2]. Figure 1 [Fig. 1] shows how DESIGN and MODEL knowledge is entered into WISDOM. After the first year of development, WISDOM already provides several novel biomedical informatics services that utilize this meta data.

  • Preparing structured information to be included with study protocols, e.g., study time line, inclusion / exclusion criteria, anticipated AEs.
  • Automatically creating a data base (currently: Oracle) on a secure server without the need for any additional user interaction.
  • Providing Web-based access to the data base through browser windows, where users can exchange rectangular ranges of data between commercially available spreadsheet programs and the database using standard cut-and-paste functionality.
  • Generating CRFs for data entry, either on paper, or as Web forms.

Discussion

A previous knowledge-based approach [3] suffered from technological shortcomings. In particular, it lacked sufficiently user-friendly tools for knowledge acquisition. Thus, when it was commercialized by SAS, Inc. in 1989 as JMP, only a small portion of its potential was realized. As new technologies for developing graphical user interfaces become available we have now developed a knowledge acquisition paradigm (Figure 1 [Fig. 1]), where clinical trials, genetic studies, or gene expression experiments can be described interactively in sufficient detail to extract DESIGN and MODEL knowledge sufficient to support a wide range of study related activities.

The current version of WISDOM already provides tangible benefits to our investigators. Data is being stored on a secure server with routine back up and Web access independent of operating systems. We are now working on incorporating these data into study management and other components of the overall GCRC IT. In particular, we will draw on the PANOS experience to facilitate graphical and statistical analyses. As WISDOM will contain sufficient information to perform the statistical analyses addressing the primary objectives, these analyses will be carried out automatically after the data has been entered or at any predefined interim time point.

The modular design also allows for applications in other areas or research than clinical studies. Currently, we are working on developing a MODEL knowledge base for microarray experiments [4], so that WISDOM will be able to function as a knowledge based interface to https://mustat.rockefeller.edu, our multi-processor Web server for multivariate ordinal data, in general, and for data mining based on genetic and genomic profiles [5] in stratified designs [6].

Acknowledgements

This work was partially supported by National Center for Research Resources at the National Institutes of Health; contract/grant number: M01-RR00102.


References

1.
Wittkowski KM. Eine wissensbasierte Schnittstelle zu statistischen Auswertungssystemen. In: Böcker HD, Glatthaar W, Strothotte T, editors. Mensch-Computer-Kommunikation: Benutzergerech-te Systeme auf dem Weg in die industrielle Gesellschaft. Berlin, D: Springer; 1993. p. 166-78.
2.
Wittkowski KM. A structured visual language for a knowledge-based front-end to statistical analysis systems in biomedical research. Comput Methods Programs Biomed 1991; 35: 59-67.
3.
Wittkowski KM. A knowledge-based interface to statistical analysis systems - report on the current status of the MS-DOS implementation. In: Faulbaum F, editor. SoftStat '91. Stuttgart, D: Fischer; 1992. p. 57-64
4.
Suárez-Fariñas M, Haider A, Wittkowski KM. "Harshlighting" small blemishes on microarrays. BMC Bioinformatics 2005; 6: 65
5.
Wittkowski KM, Lee E, Nussbaum R, Chamian FN, Krueger JG. Combining several ordinal measures in clinical studies. Stat Med 2004; 23: 1579-92.
6.
Wittkowski KM, Liu X. A statistically valid alternative to the TDT. Hum Hered 2002; 54: 157-64