SciDB: A Database Management System for Applications with Complex Analytics | Semantic Scholar (2024)

Skip to search formSkip to main contentSkip to account menu

Semantic ScholarSemantic Scholar's Logo
@article{Stonebraker2013SciDBAD, title={SciDB: A Database Management System for Applications with Complex Analytics}, author={Michael Stonebraker and Paul G. Brown and Donghui Zhang and Jacek Becla}, journal={Computing in Science \& Engineering}, year={2013}, volume={15}, pages={54-62}, url={https://api.semanticscholar.org/CorpusID:15021242}}
  • M. Stonebraker, P. Brown, J. Becla
  • Published in Computing in science… 1 May 2013
  • Computer Science, Engineering
  • Computing in Science & Engineering

A description and discussion of the SciDB database management system focuses on lessons learned, application areas, performance comparisons against other solutions, and additional approaches to

177 Citations

Highly Influential Citations

19

Background Citations

77

Methods Citations

62

Results Citations

1

Figures and Tables from this paper

  • table 1
  • figure 2
  • figure 4

177 Citations

Evaluating Genomic Big Data Operations on SciDB and Spark
    Simone CattaniS. CeriAbdulrahman KaitouaPietro Pinoli

    Biology, Computer Science

    ICWE

  • 2017

A new, holistic data management system for genomics is developing, which provides high-level abstractions for querying large genomic datasets, and is designed so that it leverages on data management engines for low-level data access.

  • 4
  • PDF
Data Management in Machine Learning Systems
    Matthias BoehmArun KumarJun Yang

    Computer Science

    Data Management in Machine Learning Systems

  • 2019

This work states that large-scale data analytics using machine learning (ML) underpins many modern data-driven applications and provides means of specifying and executing these ML workloads in an efficacious manner.

  • 38
Using SIMD Instructions to Accelerate Sequence Similarity Searches Inside a Database System
    Sidath Randeni KadupitigeUwe Röhm

    Computer Science

    ADC

  • 2018

Although many database management systems (DBMS) are extensible via stored procedures to implement transactions or complex algorithms, these stored procedures are usually unable to leverage the inbuilt optimizations provided by the query engine, so other optimization avenues must be explored.

Mining Big and Fast Data: Algorithms and Optimizations for Real-Time Data Processing
    Muhammad Anis Uddin Nasir

    Computer Science, Engineering

  • 2018

In the last decade, real-time data processing has attracted much attention from both academic community and industry, as the meaning of big data has evolved to incorporate as well the speed of data

  • 2
  • PDF
Database Architectures: Current State and Development
    J. PokornýKarel Richta

    Computer Science

    DATA

  • 2015

The paper presents shortly a history and development of database management tools in last decade, including Big Data and Big Analytics as driving forces that together with a progress in hardware development led to new DBMS architectures.

  • PDF
AIDA - Abstraction for Advanced In-Database Analytics
    Joseph Vinish D'silvaFlorestan De MoorBettina Kemme

    Computer Science

    Proc. VLDB Endow.

  • 2018

AIDA is proposed - an abstraction for advanced in-database analytics that emulates the syntax and semantics of popular data science packages but transparently executes the required transformations and computations inside the RDBMS.

  • 38
  • PDF
A Paradigm for Scalable, Transactional, and Efficient Spatial Indexes
    Ning Gao

    Computer Science, Engineering

  • 2018

Approximation is a higher level and expressive abstraction for scalable fine-grained systems and can be used for modeling, simulation, and control of distributed systems.

  • 1
SciDB DBMS Research at
    M. StonebrakerJennie DugganL. BattleOlga Papaemmanouil

    Computer Science, Engineering

  • 2013

The work on making SciDB elastic, providing skew-aware join strategies, and producing scalable visualizations of scientific data is summarized.

  • PDF
The Rise of NoSQL Systems: Research and Pedagogy
    A. BajajWade Bick

    Computer Science, Education

    J. Database Manag.

  • 2020

This review article discusses how teaching of NoSQL may be incorporated into traditional undergraduate database courses in information systems curricula and reviews recent research on each type of system.

  • 4
SciDB DBMS Research at M.I.T
    M. StonebrakerJennie DugganL. BattleOlga Papaemmanouil

    Computer Science, Engineering

    IEEE Data Eng. Bull.

  • 2013

The work on making SciDB elastic, providing skew-aware join strategies, and producing scalable visualizations of scientific data is summarized.

  • 60
  • PDF

...

...

18 References

The Architecture of SciDB
    M. StonebrakerPaul BrownA. PoliakovSuchi Raman

    Computer Science

    SSDBM

  • 2011

This paper presents the main design decisions of SciDB, including decisions concerning a high-level, SQL-like query language, the issues facing the query optimizer and executor and efficient storage management for arrays.

  • 202
  • PDF
Agrios : A Hybrid Approach to Scalable Data Analysis Systems
    Patrick Leyshock

    Computer Science

  • 2012

At the heart of Agrios lies Bonneville, an extension of the Columbia database optimizer. Bonneville utilizes Columbia’s methods for exploring the search space, but differs in several ways.

  • 4
  • PDF
A comparison of approaches to large-scale data analysis
    Andrew PavloErik Paulson M. Stonebraker

    Computer Science

    SIGMOD Conference

  • 2009

A benchmark consisting of a collection of tasks that are run on an open source version of MR as well as on two parallel DBMSs shows a dramatic performance difference between the two paradigms.

  • 1,241
  • PDF
A Demonstration of DBWipes: Clean as You Query
    Eugene WuS. MaddenM. Stonebraker

    Computer Science

    Proc. VLDB Endow.

  • 2012

DBWipes is presented, a novel data cleaning system that allows users to execute aggregate queries, and interactively detect, understand, and clean errors in the query results.

  • 12
  • PDF
Report from the first Workshop on Extremely Large Databases
    J. BeclaKian-Tat Lim

    Computer Science

    Data Sci. J.

  • 2008

This paper is the final report of the discussions and activities at the workshop on extremely large databases, and focuses on practical solutions or influencing DBMS vendors.

Efficient Versioning for Scientific Array Databases
    Adam SeeringP. Cudré-MaurouxS. MaddenM. Stonebraker

    Computer Science

    2012 IEEE 28th International Conference on Data…

  • 2012

This paper describes a versioned database storage manager for SciDB, designed to efficiently store and retrieve array-oriented data, exposing a "no-overwrite" storage model in which each update creates a new "version" of an array.

  • 52
  • PDF
ArrayStore: a storage manager for complex parallel array processing
    Emad SoroushM. BalazinskaDaniel L. Wang

    Computer Science

    SIGMOD '11

  • 2011

It is shown that ArrayStore outperforms previously proposed storage management strategies in the context of its diverse target workload, and develops a new and efficient storage-management mechanism that enables parallel processing of operations that must access data from adjacent partitions.

  • 134
  • PDF
EarthDB: scalable analysis of MODIS data using SciDB
    Gary PlanthaberM. StonebrakerJ. Frew

    Computer Science, Environmental Science

    BigSpatial '12

  • 2012

EarthDB is presented, a system that eliminates a cruel dilemma by enabling painless importing of MODIS Level 1B data into SciDB, a highly scalable science-oriented database platform that abstracts away the complexity of distributed storage and analysis of complex multi-dimensional data.

  • 66
The Design of the POSTGRES Storage System
    M. Stonebraker

    Computer Science

    VLDB

  • 1987

The design of the storage system for the POSTGRES data base system under construction at Berkeley is novel in several ways and suggests that it is performance competitive with WAL systems in many situations.

  • 302
  • PDF
Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct
    P. BonczS. ManegoldM. Kersten

    Computer Science, Biology

    Proc. VLDB Endow.

  • 2009

A trip report on the quest to find a database architecture solution that is Scalable & Speedy, to run on anything from small ARM processors up to globally distributed compute clusters, Stable & Secure, to service a broad user community, Small & Simple.

  • 142
  • PDF

...

...

Related Papers

Showing 1 through 3 of 0 Related Papers

    SciDB: A Database Management System for Applications with Complex Analytics | Semantic Scholar (2024)

    References

    Top Articles
    Latest Posts
    Recommended Articles
    Article information

    Author: Annamae Dooley

    Last Updated:

    Views: 6555

    Rating: 4.4 / 5 (65 voted)

    Reviews: 88% of readers found this page helpful

    Author information

    Name: Annamae Dooley

    Birthday: 2001-07-26

    Address: 9687 Tambra Meadow, Bradleyhaven, TN 53219

    Phone: +9316045904039

    Job: Future Coordinator

    Hobby: Archery, Couponing, Poi, Kite flying, Knitting, Rappelling, Baseball

    Introduction: My name is Annamae Dooley, I am a witty, quaint, lovely, clever, rich, sparkling, powerful person who loves writing and wants to share my knowledge and understanding with you.