-
JoinBoost: Grow Trees Over Normalized Data Using Only SQL
Zezhou Huang, Rathijit Sen, Jiaxiang Liu, Eugene Wu
VLDB 2023
-
Saibot: A Differentially Private Data Search Platform
Zezhou Huang, Jiaxiang Liu, Daniel Alabi, Raul Castro Fernandez, Eugene Wu
VLDB 2023
-
GAMUT: Matrix Multiplication-Like Tasks on GPUs
Xincheng Xie, Junyoung Kim, Kenneth Ross
ADMS 2023
-
Amulet: Adaptive matrix-multiplication-like tasks
Junyoung Kim, Kenneth A Ross, Eric Sedlar, Lukas Stadler
DaMoN 2023
-
Interactive Interface Generation in Notebooks
Jeffrey Tao, Yiru Chen, Eugene Wu
SIGMOD (demo) 2022
-
PI2: Generating Visual Analysis Interfaces From Queries
Yiru Chen, Eugene Wu
SIGMOD 2022
-
Reptile: Aggregation-level Explanations for Hierarchical Data
Zachary Huang, Eugene Wu
SIGMOD 2022
-
Enabling SQL-based training data debugging for federated learning
Young Wu, Yejia Liu, Lampros Flokas, Jiannan Wang, Eugene Wu
VLDB 2022
-
Complaint-Driven Training Data Debugging at Interactive Speeds
Lampros Flokas, Young Wu, Jiannan Wang, Nakul Verma, Eugene Wu
SIGMOD 2022
-
Adaptive Code Generation for Data-Intensive Analytics
Wangda Zhang, Junyoung Kim, Kenneth A. Ross, Eric Sedlar, Lucas Stadler
VLDB 2021
-
Quantifying the effects of COVID-19 on restaurant reviews
Ivy Cao, Zizhou Liu, Giannis Karamanolakis, Daniel Hsu, Luis Gravano
SocialNLP 2021
-
Physical Visualization Design
Lana Ramjit, Zhaoning Kong, Ravi Netravali, Eugene Wu
SIGMOD (demo) 2020
-
VIP: A SIMD Vectorized Analytical Query Engine
Orestis Polychroniou, Kenneth A. Ross
VLDB Journal 2020
-
Parallel Prefix Sum with SIMD
Wangda Zhang, Yanbin Wang, Kenneth A. Ross
ADMS 2020
-
Permutation Index: Exploiting Data Skew for Improved Query Performance
Wangda Zhang, Kenneth A. Ross
ICDE 2020
-
Exploiting Data Skew for Improved Query Performance
Wangda Zhang, Kenneth A. Ross
IEEE TKDE 2020
-
Efficient Search over Genomic Short Read Data
Wangda Zhang, Mengdi Lin, Kenneth A. Ross
SSDBM 2020
-
Towards Complaint-driven ML Workflow Debugging
Lampros Flokas, Young Wu, Jiannan Wang, Eugene Wu
MLOps 2020
-
Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces
Yiru Chen, Eugene Wu
Intelligent Process Automation (IPA) 2020
-
Towards Practical Vectorized Analytical Query Engines
Orestis Polychroniou, Kenneth A. Ross
DaMoN 2019
-
Master of None Acceleration: A Comparison of Accelerator Architectures for Analytical Query Processing
Andrea Lottarini, João Pedro Cerqueira, Thomas J. Repetti, Stephen A. Edwards, Kenneth A. Ross, Mingoo Seok, Martha A. Kim
ISCA 2019
-
Acorn: Aggressive Result Caching in Spark SQL
Alana Ramjit, Matteo Interlandi, Eugene Wu, Ravi Netravali
SOCC 2019
-
Towards Democratizing Relational Data Visualization
Nan Tang, Eugene Wu, Guoliang Li
SIGMOD 2019 Tutorial
-
Precision Interfaces
Qianrui Zhang, Haoci Zhang, Viraj Rai, Thibault Sellam, Eugene Wu
SIGMOD 2019
-
Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment
Pei Wang, Jiannan Wang, Ryan Shea, Eugene Wu
SIGMOD 2019
-
DeepBase: Deep Inspection of Neural Networks
Thibault Sellam, Kevin Lin, Ian Yiran Huang, Michelle Yang, Carl Vondrick, Eugene Wu
SIGMOD 2019
-
Distributed Joins and Data Placement for Minimal Network Traffic
Orestis Polychroniou, Wangda Zhang, Kenneth A. Ross
TODS 2018
-
Ten Years of Web Tables
Michael Cafarella, Alon Halevy, Daisy Zhe Wang, Hongrae Lee, Jayant Madhavan, Cong Yu, Eugene Wu,
PVLDB 2018 Invited Paper,
-
At a Glance: Approximate Entropy as a Measure of Line Chart Visualization Complexity
Gabriel Ryan, Abigail Mosca, Remco Chang, Eugene Wu
InfoVIS 2018
-
Provenance in Interactive Visualizations
Fotis Psallidas, Eugene Wu
HILDA 2018
-
Leveraging Quality Prediction Models for Automatic Writing Feedback
Hamed Nilforoshan, Eugene Wu
ICWSM 2018
-
Precision Interfaces for Different Modalities
HaoCi Zhang, Viraj Rai, Thibault Sellam, Eugene Wu
SIGMOD (demo) 2018
-
Demonstration of Smoke: A Deep Breath of Data-Intensive Lineage Applications
Fotis Psallidas, Eugene Wu
SIGMOD (demo) 2018
-
Deeper: A Data Enrichment System Powered by Deep Web.
Pei Wang, Yongjun He, Ryan Shea, Jiannan Wang, Eugene Wu.
SIGMOD (demo) 2018
-
“I Like the Way You Think!” Inspecting the Internal Logic of Recurrent Neural Networks
Thibault Sellam, Kevin Lin, Ian Yiran Huang, Carl Vondrick, Eugene Wu
SysML 2018
-
Smoke: Fine-grained Lineage at Interactive Speeds
Fotis Psallidas, Eugene Wu
VLDB 2018
-
BoostClean: Automated Error Detection and Repair for Machine Learning
Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Eugene Wu
Tech Report 2017
-
Network Synthesis for Database Processing Units
Andrea Lottarini, Stephen A. Edwards, Kenneth A. Ross, Martha A. Kim
DAC 2017
-
Deadlock-free joins in DB-mesh, an asynchronous systolic array accelerator
Bingyi Cao, Kenneth A. Ross, Stephen A. Edwards, Martha A. Kim
DAMON 2017
-
Combining Design and Performance in a Data Visualization Management System
Eugene Wu, Fotis Psallidas, Zhengjie Miao, Haoci Zhang,Laura Rettig, Yifan Wu, Thibault Sellam
CIDR 2017
-
A DeVIL-ish Approach to Inconsistency in Interactive Visualizations
Yifan Wu, Joe Hellerstein, Eugene Wu
Hilda 2016
-
PFunk-H: Approximate Query Processing using Perceptual Models
Daniel Alabi, Eugene Wu
Hilda 2016
-
Towards Reliable Interactive Data Cleaning: A User Survey and Recommendations
Sanjay Krishnan, Daniel Haas, Michael J. Franklin, Eugene Wu
Hilda 2016
-
ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning
Sanjay Krishnan, Michael Franklin, Ken Goldberg, Jiannan Wang, Eugene Wu
SIGMOD 2016 Demo
-
SIMD-accelerated regular expression matching
E. A. Sitaridi, O. Polychroniou, K. A. Ross
DAMON 2016
-
k-Shape: Efficient and Accurate Clustering of Time Series
J. Paparrizos and L. Gravano
SIGMOD Record 2016
-
Detecting Devastating Diseases in Search Logs
J. Paparrizos, R. W. White, and E. Horvitz
SIGKDD 2016
-
Screening for Pancreatic Adenocarcinoma Using Signals From Web Search Logs: Feasibility Study and Results
J. Paparrizos, R. W. White, and E. Horvitz
Journal of Oncology Practice
-
CLAMShell: Speeding up Crowds for Low-latency Data Labeling
D. Haas, J. Wang, E. Wu, and M J. Franklin
VLDB 2016
-
Massively-Parallel Lossless Data Decompression
Evangelia A. Sitaridi, RenŽ MŸller, Tim Kaldewey, Guy M. Lohman, Kenneth A. Ross
ICPP 2016
-
A Course on Programming and Problem Solving
S. Sheth, C. Murphy, K. A. Ross, D. E. Shasha
SIGCSE 2016
-
GPU-accelerated string matching for database applications
E. Sitaridi and K. A. Ross
VLDB Journal 2016
-
Exploiting SSDs in operational multiversion databases
M. Sadoghi, K. A. Ross, M. Canim, B. Bhattacharjee
VLDB Journal 2016
-
Towards Perception-aware Interactive Data Visualization Systems
E. Wu and A. Nandi
DSIA 2015
-
SampleClean: Fast and Reliable Analytics on Dirty Data
S. Krishnan, J. Wang, M. J. Franklin, K. Goldberg, T. Kraska, T. Milo, and E. Wu
Overview paper
-
The Q100 Database Processing Unit
L. Wu, A. Lottarini, T. K. Paine, M. A. Kim, K. A. Ross
IEEE MICRO 2015
-
Efficient Lightweight Compression Alongside Fast Scans
O. Polychroniou and K. A. Ross
DAMON 2015
-
Implementing Latency-Insensitive Dataflow Blocks
B. Cao, K. A. Ross, M. A. Kim, and S. A. Edwards
MEMOCODE 2015
-
Wisteria: Nurturing Scalable Data Cleaning Infrastructure (Demo)
D. Haas, S. Krishnan, J. Wang, M. J. Franklin, and E. Wu
VLDB 2015
-
Collaborative Data Analytics with Datahub (Demo)
A. Bhardwaj, A. Deshpande, A. Elmore, D. Karger, S. Madden, A. Parameswaran, H. Subramanyam, E. Wu, and R. Zhang
VLDB 2015
-
Ranking Deep Web Text Collections for Scalable Information Extraction
P. Barrio, L. Gravano, and C. Develder
CIKM 2015
-
k-Shape: Efficient and Accurate Clustering of Time Series
J. Paparrizos and L. Gravano
SIGMOD 2015
-
Learning to Rank Adaptively for Scalable Information Extraction
P. Barrio, G. Sim›es, H. Galhardas, and L. Gravano
EDBT 2015
-
Rethinking SIMD Vectorization for In-Memory Databases
O. Polychroniou, A. Raghavan, K. A. Ross
SIGMOD 2015
-
The Case for Data Visualization Management Systems
E. Wu, L. Battle, and S. Madden
VLDB 2014
-
Hardware Partitioning for Big Data Analytics
L. Wu, R. J. Barker, M. A. Kim, K. A. Ross:
IEEE MICRO 2014
-
Reducing Database Locking Contention Through Multi-version Concurrency
M. Sadoghi, M. Canim, B. Bhattacharjee, F. Nagel, K. A. Ross
PVLDB 2014
-
Energy Analysis of Hardware and Software Range Partitioning
L. Wu, O. Polychroniou, R. J. Barker, M. A. Kim, and K. A. Ross
TOCS 2014
-
Coherent Somatic Mutation in Autoimmune Disease
K. A. Ross
PLoS One 2014
-
Vectorized Bloom Filters for Advanced SIMD Processors
O. Polychroniou and K. A. Ross
DAMON 2014
-
Q100: The Architecture and Design of a Database Processing Unit
L. Wu, A. Lottarini, T. K. Paine, M. A. Kim, and K. A. Ross
ASPLOS 2014
-
A Comprehensive Study of Main-memory Partitioning and its Application to Large-scale Comparison- and Radix-sort
O. Polychroniou and K. A. Ross
SIGMOD 2014
-
Track Join: Distributed Joins with Minimal Network Traffic
O. Polychroniou, R. Sen, and K. A. Ross
SIGMOD 2014
-
Detecting Foodborne Disease Outbreaks Using Social Media (demonstration)
F. Psallidas, L. Gravano, and many others
NYC Media Lab's Annual Summit, 2014
-
Information Extraction from Social Media for Public Health
N. Elhadad, L. Gravano, D. Hsu, S. Balter, V. Reddy, and H. Waechter
KDD at Bloomberg Workshop, Data Frameworks Track (KDD 2014), 2014
-
REEL: A Relation Extraction Learning Framework (poster)
P. Barrio, G. Sim›es, H. Galhardas, and L. Gravano
JCDL 2014
-
Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness Ñ New York City, 2012Ð2013
C. Harrison, M. Jorder, H. Stern, F. Stavinsky, V. Reddy, H. Hanson, H. Waechter, L. Lowe, L. Gravano, and S. Balter
Centers for Disease Control and Prevention Morbidity and Mortality Weekly Report 2014
-
Data In Context: Aiding News Consumers while Taming Dataspaces
E. Wu, A. Marcus, and S. Madden
DBCrowd 2013
-
Mobile applications need Targeted Micro-updates
A. Cheung, L. Ravindranath, E. Wu, S. Madden, and H. Balakrishnan
APSYS 2013
-
Scorpion: Explaining Away Outliers in Aggregate Queries
E. Wu and S. Madden
VLDB 2013 (Selected as one of the best papers of the conference!)
-
SubZero: a Fine-Grained Lineage System for Scientific Databases
E. Wu, S. Madden, and M. Stonebraker
ICDE 2013 (Selected as one of the best papers of the conference!)
-
Making Updates Disk-I/O Friendly Using SSDs
M. Sadoghi, K. A. Ross, M. Canim, and B. Bhattacharjee
PVLDB 2013
-
Navigating Big Data with High-Throughput, Energy-Efficient Data Partitioning
L. Wu, R. J. Barker, M. A. Kim, and K. A. Ross
ISCA 2013 (A version of this article appears in IEEE Micro 2014 as one of the "Top Picks"
from 2013.)
-
Optimizing Select Conditions on GPUs
E. Sitaridi and K. A. Ross
DAMON 2013
-
High Throughput Heavy Hitter Aggregation for Modern SIMD Processors
O. Polychroniou and K. A. Ross
DAMON 2013
-
When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms
G. Sim›es, H. Galhardas, and L. Gravano
VLDB 2013
-
Effective Event Identification in Social Media
F. Psallidas, H. Becker, M. Naaman, and L. Gravano
IEEE Data Eng. Bull. 2013
-
Using Restaurant Review Websites to Identify Unreported Complaints of Foodborne Illness
C. Harrison, M. Joarder, H. Stern, F. Stavinsky, V. Reddy, L. Gravano, and S. Balter
CSTE Annual Conference, Poster 2013
-
A Demonstration of DBWipes: Clean as You Query
E. Wu, S. Madden, and M. Stonebraker
VLDB 2012
-
Human-powered Sorts and Joins
A. Marcus, E. Wu, D. Karger, S. Madden, and Robert Miller
VLDB 2012
-
Path Processing Using Solid State Storage
M. Athanassoulis, B. Bhattacharjee, M. Canim, and K. A. Ross
ADMS 2012
-
Ameliorating Memory Contention of OLAP Operators on GPU Processors
E. Sitaridi and K. A. Ross
DAMON 2012 (Best Paper Award winner.)
-
Answering General Time-Sensitive Queries
W. Dakka, L. Gravano, and P. Ipeirotis
TKDE 2012
-
Identifying Content for Planned Events Across Social Media Sites
H. Becker, D. Iter, M. Naaman, and L. Gravano
WSDM 2012
-
Partitioning Techniques for Fine-Grained Indexing
E. Wu and S. Madden
ICDE 2011
-
Demonstration of Qurk: A Query Processor for Human Operators
A. Marcus, E. Wu, D. Karger, S. Madden, and R. Miller
SIGMOD 2011
-
No Bits Left Behind
Eugene Wu, Carlo Curino, and Sam Madden
CIDR 2011
-
Crowdsourced Databases: Query Processing with People
A. Marcus, E. Wu, S. Madden, and R. Miller
CIDR 2011
-
Relational Cloud: A Database-as-a-Service for the Cloud
C. Curino, E. Jones, R. Popa, N. Malviya, E. Wu, S. Madden, H. Balakrishnan, and N. Zeldovich
CIDR 2011
-
Column-Oriented Query Processing for Row Stores
A. El-Helw, K. A. Ross, B. Bhattacharjee, C. A. Lang, and G. A. Mihaila
International Workshop on Data Warehousing and OLAP 2011
-
Thread-Level Parallel Indexing of Update Intensive Moving-Object Workloads
D. Sidlauskas, K. A. Ross, C. S. Jensen, and S. Saltenis
International Symposium on Advances in Spatial and Temporal Databases 2011
-
Scalable Aggregation on Multicore Processors
Y. Ye, K. A. Ross, and N. Vesdapunt
DAMON 2011
-
Enhancing Recovery using an SSD Buffer Pool Extension
B. Bhattacharjee, K. A. Ross, C. A. Lang, G. A. Mihaila, and M. Banikazemi
DAMON 2011
-
SkylineSearch: Semantic Ranking and Result Visualization for Pubmed
J. Stoyanovich, M. Lodha, W. Mee, and K. A. Ross
SIGMOD 2011 Demo
-
Evidence for somatic gene conversion and deletion in bipolar disorder, Crohn's disease, coronary artery disease, hypertension, rheumatoid arthritis, type-1 diabetes, and type-2 diabetes
K. A. Ross
BMC Medicine 2011
-
Hip and Trendy: Characterizing Emerging Trends on Twitter
M. Naaman, H. Becker, and L. Gravano
JASIST 2011
-
Beyond Trending Topics: Real-World Event Identification on Twitter
H. Becker, M. Naaman, and L. Gravano
ICWSM 2011
-
Selecting Quality Twitter Content for Events
H. Becker, M. Naaman, and L. Gravano
ICWSM 2011
-
Quality Impact of Value Matching and Scoring in Top-k Entity Attribute Extraction
M. Solomon, L. Gravano, and C. Yu
DBRank 2011
-
Automatic Identification and Presentation of Twitter Content for Planned Events (demonstration)
H. Becker, F. Chen, D. Iter, M. Naaman, and L. Gravano
ICWSM 2011
-
Relational Cloud: The Case for a Database Service
C. Curino, E. Jones, Y. Zhang, E. Wu, and S. Madden
MIT CSAIL Technical Report
-
TrajStore: An Adaptive Storage System for Very Large Trajectory Data Sets
P. Cudre-Mauroux, E. Wu, and S. Madden
ICDE 2010
-
Storage Class Memory Aware Data Management
B. Bhattacharjee, M. Canim, C. Lang, G. Mihaila, and K. A. Ross
IEEE Data Eng. Bull. 2010
-
SSD Bufferpool Extensions for Database Systems
M. Canim, G. Mihaila, B. Bhattacharjee, K. A. Ross, and C. Lang
VLDB 2010
-
Buffered Bloom Filters on Solid State Storage
M. Canim, G. Mihaila, B. Bhattacharjee, C. Lang, and K. A. Ross
ADMS 2010
-
Automatic Contention Detection and Amelioration for Data Intensive Operations
J. Cieslewicz, K. A. Ross, K. Satsumi, and Y. Ye
SIGMOD 2010
-
Optimizing Read Convoys in Main-Memory Query Processing
K. A. Ross
DAMON 2010
-
Semantic Ranking and Result Visualization for Life Sciences Publications
J. Stoyanovich, W. Mee, and K. A. Ross
ICDE 2010
-
Learning Similarity Metrics for Event Identification in Social Media
H. Becker, M. Naaman, and L. Gravano
WSDM 2010
-
Popularity-Guided Top-k Extraction of Entity Attributes
M. Solomon, C. Yu, and L. Gravano
WebDB 2010
-
Exploiting Social Links for Event Identification in Social Media (poster)
H. Becker, B. Xiao, M. Naaman, and L. Gravano
SSM 2010
-
Demonstration of the TrajStore System
E. Wu, P. Cudre-Mauroux, and S. Madden
VLDB 2009
-
The Case for RodentStore: An Adaptive, Declarative Storage System
P. Cudre-Mauroux, E. Wu, and S. Madden
CIDR 2009
-
Efficient Index Compression in DB2 LUW
B. Bhattacharjee, L. Lim, T. Malkemus, G. Mihaila, K. A. Ross, S. Lau, C. McCarthur, Z. Toth, and R. Sherkat
PVLDB 2009
-
An Object Placement Advisor for DB2 Using Solid State Storage
M. Canim, B. Bhattacharjee, G. Mihaila, C. Lang, and K. A. Ross
PVLDB 2009
-
Cache Conscious Buffering for Database Operators with State
J. Cieslewicz, W. Mee, and K.A. Ross
DAMON 2009
-
Optimal Splitters for Database Partitioning with Size Bounds
K. A. Ross and J. Cieslewicz
ICDT 2009
-
Evaluating Application Mapping Scenarios on the Cell/B.E
A. L. Varbanescu, H. J. Sips, K. A. Ross, Q. Liu, A. Natsev, J. R. Smith, and L. K. Liu
Concurrency and Computation: Practice and Experience 2009
-
Join Optimization of Information Extraction Output: Quality Matters!
A. Jain, P. Ipeirotis, A. Doan, and L. Gravano
ICDE 2009
-
Event Identification in Social Media
H. Becker, M. Naaman, and L. Gravano
WebDB 2009
-
WebTables: Exploring the Power of Tables on the Web
M. Cafarella, A. Halevy, D. Wang, and E. Wu, Y. Zhang
VLDB 2008
-
Uncovering the Relational Web
M. Cafarella, N. Khoussainova, D. Wang, E. Wu, Y. Zhang, and A. Halevy
WebDB 2008
-
QueryScope: visualizing queries for repeatable database tuning
L. Hu, K. A. Ross, Y. C. Chang, C. A. Lang, and D. Zhang
VLDB Demo 2008
-
Modeling the Performance of Algorithms on Flash Memory Devices
K.A. Ross
DAMON 2008
-
Data Partitioning on Chip Multiprocessors
J. Cieslewicz and K.A. Ross
DAMON 2008
-
Database Optimizations for Modern Hardware
J. Cieslewicz and K.A. Ross
Proceedings of the IEEE 2008
-
Schema Polynomials and Applications
K. A. Ross and J. Stoyanovich
EDBT 2008
-
Classification-Aware Hidden-Web Text Database Selection
P. Ipeirotis and L. Gravano
TOIS 2008
-
Answering General Time-Sensitive Queries
W. Dakka, L. Gravano, and P. Ipeirotis
CIKM 2008
-
Optimizing SQL Queries over Text Databases
A. Jain, A. Doan, and L. Gravano
ICDE 2008
-
Building Query Optimizers for Information Extraction: The SQoUT Project
A. Jain, P. Ipeirotis, and L. Gravano
SIGMOD Record, Special Issue on "Managing Information Extraction," 2008
-
SASE: Complex Event Processing over Streams (Demo)
D. Gyllstrom, E. Wu, H. J. Chae, Y. Diao, P. Stahlberg, and G. Anderson
CIDR 2007
-
Adaptive Aggregation on Chip Multiprocessors
J. Cieslewicz and K. A. Ross
VLDB 2007
-
An Effective Strategy for Porting C++ Applications on Cell
A. L. Varbanescu, H. J. Sips, K. A. Ross, Q. Liu, L. K. Liu, A. Natsev, and J. R. Smith
ICPP 2007
-
Digital Media Indexing on the Cell Processor
L. K. Liu, Q. Liu, A. Natsev, K. A. Ross, J. R. Smith, and A. L. Varbanescu
ICME 2007
-
Running Applications on Cell BE - a Performance Study
A. L. Varbanescu, H. J. Sips, K. A. Ross, Q. Liu, A. Natsev, J. R. Smith, and L. K. Liu
Workshop on Compilers for Parallel Computers 2007
-
Parallel Buffers for Chip Multiprocessors
J. Cieslewicz, K.A. Ross, and I. Giannakakis
DAMON 2007
-
A Faceted Query Engine Applied to Archeology
K.A. Ross, A. Janevski, and J. Stoyanovich
Internet Archeology 2007
-
On the Adequacy of Partial Orders for Preference Composition
K. A. Ross
DBRank 2007
-
Practical Preference Relations for Large Data Sets
K. A. Ross, P. J. Stuckey, and A. Marian
DBRank 2007
-
Efficient Hash Probes on Modern Processors
K. A. Ross
ICDE 2007
-
Partitioned Optimization of Complex Queries
D. Chatziantoniou and K. A. Ross
Information Systems 2007
-
Finding Shapes in a Set of Points
K. A. Ross, D. Vespe, D. Hessing, and P. Jain
SIGMOD Record 2007
-
Towards a Query Optimizer for Text-Centric Tasks
P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano
TODS 2007
-
Modeling and Managing Changes in Text Databases
P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano
TODS 2007
-
Efficient Summarization-Aware Search for Online News Articles
W. Dakka and L. Gravano
JCDL 2007
-
Efficient Keyword Search Across Heterogeneous Relational Databases
M. Sayyadian, H. LeKhac, A. Doan, and L. Gravano
ICDE 2007
-
SQL Queries Over Unstructured Text Databases
A. Jain, A. Doan, and L. Gravano
ICDE 2007
-
High-performance complex event processing over streams
E. Wu, Y. Diao, and S. Rizvi
SIGMOD 2006
-
SASE: Complex Event Processing over Streams
D. Gyllstrom, E. Wu, H. J. Chae, Y. Diao, P. Stahlberg, and G. Anderson
CoRR 2006
-
Probabilistic Data Management for Pervasive Computing: The Data Furnace Project
M. N. Garofalakis, K. P. Brown, M. J. Franklin, J. M. Hellerstein, D. Z. Wang, E. Michelakis, L. Tancau., E. Wu, S. R. Jeffery, and R. Aipperspach
IEEE Data Eng. Bull 2006
-
Alpha Radiation is a Major Germ-Line Mutagen over Evolutionary Timescales
K. A. Ross
Evolutionary Ecology Research 2006
-
Realizing Parallelism in Database Operations: Insights from a Massively Multithreaded Architecture
J. Cieslewicz, J. Berry, B. Hendrickson, and K.A. Ross
DAMON 2006 (Best Paper Award winner.)
-
To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks
P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano
SIGMOD 2006
-
Design Considerations for High Fan-In Systems: The HiFi Approach
M. J. Franklin, S. R. Jeffery, S. Krishnamurthy, F. Reiss, S. Rizvi, E. Wu, O. Cooper, A. Edakkunni, and W. Hong
CIDR 2005
-
Academic Dishonesty and the Internet
K. A. Ross
Communications of the ACM 2005
-
A Faceted Query Engine Applied to Archaeology
K. A. Ross, A. Janevski, and J. Stoyanovich
VLDB 2005 Demo
-
Improving Database Performance on Simultaneous Multithreading Processors
J. Zhou, J. Cieslewicz, K. A. Ross, and M. Shah
VLDB 2005
-
Architecture Sensitive Database Design: Examples from the Columbia Group
K. A. Ross, J. Cieslewicz, J. Rao, and J. Zhou
IEEE Data Eng. Bull. 2005
-
Modeling and Managing Content Changes in Text Databases
P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano
ICDE 2005
-
HiFi: A Unified Architecture for High Fan-in Systems
O. Cooper, A. Edakkunni, M. J. Franklin, W. Hong, S. R. Jeffery, S. Krishnamurthy, F. Reiss, S. Rizvi, and E. Wu
VLDB 2004 Demo
-
Symmetric Relations and Cardinality Bounded Multisets in Database Systems
K. A. Ross and J. Stoyanovich
VLDB 2004
-
Querying Faceted Databases
K. A. Ross and A. Janevski
SWDB Workshop 2004
-
Buffering Database Operations for Enhanced Instruction Cache Performance
J. Zhou and K. A. Ross
SIGMOD 2004
-
FlowPuter: A Cluster Architecture Unifying Switch, Server and Storage Processing
A. Aho, A. D. Keromytis, V. Misra, J. Nieh, K. A. Ross, and Y. Yemini
First International Workshop on Data Processing and Storage Networking: Towards Grid Computing 2004
-
Selection Conditions in Main Memory
K. A. Ross
TODS 2004
-
Optimizing Top-k Selection Queries over Multimedia Repositories
S. Chaudhuri, L. Gravano, and A. Marian
TKDE 2004
-
Evaluating Top-k Queries over Web-Accessible Databases
A. Marian, N. Bruno, and L. Gravano
TODS 2004
-
Learning to Find Answers to Questions on the Web
E. Agichtein, S. Lawrence, and L. Gravano
TOIT 2004
-
When one Sample is not Enough: Improving Text Database Selection Using Shrinkage
P. Ipeirotis and L. Gravano
SIGMOD 2004
-
Selectivity Estimation for String Predicates: Overcoming the Underestimation Problem
S. Chaudhuri, V. Ganti, and L. Gravano
ICDE 2004
-
Buffering Accesses to Memory-Resident Index Structures
J. Zhou and K. A. Ross
VLDB 2003
-
A Multi-Resolution Block Storage Model for Database Design
J. Zhou and K. A. Ross
IDEAS 2003
-
QProber: A System for Automatic Classification of Hidden-Web Databases
L. Gravano, P. Ipeirotis, and M. Sahami
TOIS 2003
-
Categorizing Web Queries According to Geographical Locality
L. Gravano, V. Hatzivassiloglou, and R. Lichtenstein
CIKM 2003
-
Efficient IR-Style Keyword Search over Relational Databases
V. Hristidis, L. Gravano, and Y. Papakonstantinou
VLDB 2003
-
Text Joins in an RDBMS for Web Data Integration
L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava
WWW 2003
-
Querying Text Databases for Efficient Information Extraction
E. Agichtein and L. Gravano
ICDE 2003
-
Navigation- vs. Index-Based XML Multi-Query Processing
N. Bruno, L. Gravano, N. Koudas, and D. Srivastava
ICDE 2003
-
Text Joins for Data Cleansing and Integration in an RDBMS
L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava
ICDE 2003
-
Modeling Query-Based Access to Text Databases
E. Agichtein, P. Ipeirotis, and L. Gravano
WebDB 2003
-
QXtract: A Building Block for Efficient Information Extraction from Text Databases (demonstration)
E. Agichtein and L. Gravano
SIGMOD 2003
-
Selection Conditions in Main Memory
K. A. Ross
PODS 2002
-
Implementing Database Operations Using SIMD Instructions
J. Zhou and K. A. Ross
SIGMOD 2002
-
Top-k Selection Queries over Relational Databases: Mapping Strategies and Performance Evaluation
N. Bruno, S. Chaudhuri, and L. Gravano
TODS 2002
-
Distributed Search over the Hidden-Web: Hierarchical Database Sampling and Selection
P. Ipeirotis and L. Gravano
VLDB 2002
-
Evaluating Top-k Queries over Web-Accessible Databases
N. Bruno, L. Gravano, and A. Marian
ICDE 2002
-
Extending SDARTS: Extracting Metadata from Web Databases and Interfacing with the Open Archives Initiative
P. Ipeirotis, T. Barry, and L. Gravano
JCDL 2002
-
Query- vs. Crawling-based Classification of Searchable Web Databases
L. Gravano, P. Ipeirotis, and M. Sahami
IEEE Data Eng. Bull. 2002
-
Cost-Based Unbalanced R-Trees
K. A. Ross, I. Sitzmann, and P. J. Stuckey
SSDBM 2001
-
Filtering Algorithms and Implementation for Very Fast Publish/Subscribe
F. Fabret, H. A. Jacobsen, F. Llirbat, J. Pereira, K. A. Ross, and D. Shasha
SIGMOD 2001
-
Approximate String Joins in a Database (Almost) for Free
L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava
VLDB 2001
-
Probe, Count, and Classify: Categorizing Hidden Web Databases
P. Ipeirotis, L. Gravano, and M. Sahami
SIGMOD 2001
-
STHoles: A Multidimensional Workload-Aware Histogram
N. Bruno, S. Chaudhuri, and L. Gravano
SIGMOD 2001
-
SDLIP + STARTS = SDARTS: A Protocol and Toolkit for Metasearching
N. Green, P. Ipeirotis, and L. Gravano
JCDL 2001
-
PERSIVAL, a System for Personalized Search and Summarization over Multimedia Healthcare Information
K. McKeown, S.-F. Chang, J. Cimino, S. Feiner, C. Friedman, L. Gravano, V. Hatzivassiloglou, S. Johnson, D. Jordan, J. Klavans, A. Kushniruk, V. Patel, and S. Teufel
JCDL 2001
-
Learning Search Engine Specific Query Transformations for Question Answering
E. Agichtein, S. Lawrence, and L. Gravano
WWW 2001
-
Snowball: A Prototype System for Extracting Relations from Large Text Collections (demonstration)
E. Agichtein, L. Gravano, J. Pavel, V. Sokolova, and A. Voskoboynik
SIGMOD 2001
-
PERSIVAL Demo: Categorizing Hidden-Web Resources (demonstration)
P. Ipeirotis, L. Gravano, and M. Sahami
JCDL 2001
-
Using q-grams in a DBMS for Approximate String Processing
L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, L. Pietarinen, and D. Srivastava
IEEE Data Eng. Bull. 2001
-
Simplifying Data Access: The Energy Data Collection Project
J. L. Ambite, Y. Arens, E. Hovy, A. Philpot, L. Gravano, V. Hatzivassiloglou, and J. Klavans
IEEE Computer 2001
-
Independence Diagrams: A Technique for Data Visualization
S. Berchtold, H. V. Jagadish, and K. A. Ross
Journal of Electronic Imaging 2000
-
Publish/Subscribe on the Web at Extreme Speed
J. Pereira, F. Fabret, F. Llirbat, R. Preotiuc-Pietro, K. A. Ross, and D. Shasha
VLDB 2000 Demo
-
Optimizing Selections over Datacubes
K. A. Ross and K. A. Zaman
SSDBM 2000
-
Serving Datacube Tuples from Main Memory
K. A. Ross and K. A. Zaman
SSDBM 2000
-
Making B+-Trees Cache Conscious in Main Memory
J. Rao and K. A. Ross
SIGMOD 2000
-
Computing Geographical Scopes of Web Resources
J. Ding, L. Gravano, and N. Shivakumar
VLDB 2000
-
An Investigation of Linguistic Features and Clustering Algorithms for Topical Document Clustering
V. Hatzivassiloglou, L. Gravano, and A. Maganti
SIGIR 2000
-
Snowball: Extracting Relations from Large Plain-Text Collections
E. Agichtein and L. Gravano
JCDL 2000
-
Automatic Classification of Text Databases through Query Probing
P. Ipeirotis, L. Gravano, and M. Sahami
WebDB 2000, Also in LNCS Series no. 1997, Springer, 2001
-
Combining Strategies for Extracting Relations from Text Collections
E. Agichtein, E. Eskin, and L. Gravano
DMKD 2000
-
Characterizing Web Resources for Improved Search
L. Gravano
Position paper for the First NSF-DELOS Workshop on Information Seeking, Searching, and Querying in Digital Libraries, 2000
-
Cache Conscious Indexing for Decision-Support in Main Memory
J. Rao and K. A. Ross
VLDB 1999
-
Fast Joins Using Join Indices
Z. Li and K. A. Ross
VLDB Journal 1999
-
GlOSS: Text-Source Discovery over the Internet
L. Gravano, H. Garcia-Molina, A. Tomasic
TODS 1999
-
Evaluating Top-k Selection Queries
S. Chaudhuri and L. Gravano
VLDB 1999
-
Exploiting Geographical Location Information of Web Pages
O. Buyukkokten, J. Cho, H. Garcia-Molina, L. Gravano, and N. Shivakumar
WebDB 1999
-
Database Research at Columbia University
S. F. Chang, L. Gravano, G. E. Kaiser, K. A. Ross, and S. Stolfo
SIGMOD Record 1998
-
Reusing Invariants: A New Strategy for Correlated Queries
J. Rao and K. A. Ross
SIGMOD 1998
-
Complex Aggregation at Multiple Granularities
K. A. Ross, D. Srivastava, and D. Chatziantoniou
EDBT 1998
-
Foundations of Aggregation Constraints
K. A. Ross, D. Srivastava, P. J. Stuckey and S. Sudarshan
Theoretical Computer Science 1998
-
Mediating and Metasearching on the Internet
L. Gravano and Y. Papakonstantinou
IEEE Data Eng. Bull 1998
-
The Stanford InfoBus and Its Service Layers: Augmenting the Internet with Higher-Level Information Management Protocols
M. Roscheisen, M. Baldonado, C.-C. K. Chang, L. Gravano, S. Ketchpel, and A. Paepcke
Digital Libraries in Computer Science: The MeDoc Approach, LNCS Series, 1998
-
The New Jersey Data Reduction Report
D. Barbara, W. DuMouchel, C. Faloutsos, P. J. Haas, J. M. Hellerstein, Y. Ioannidis, H. V. Jagadish, T. Johnson, R. Ng, V. Poosala, K. A. Ross, and K. C. Sevcik
IEEE Data Eng. Bull. 1997
-
Attribute-Oriented View Definitions in Relational and Deductive Databases
I. S. Mumick and K. A. Ross
Fifth International Conference on Deductive and Object-Oriented Databases 1997
-
Fast Computation of Sparse Datacubes
K. A. Ross and D. Srivastava
VLDB 1997
-
Groupwise Processing of Relational Queries
D. Chatziantoniou and K. A. Ross
VLDB 1997
-
Implementing Incremental View Maintenance in Nested Data Models
A. Kawaguchi, D. Lieuwen, I. S. Mumick, and K. A. Ross
International Workshop on Database Programming Languages 1997
-
Faster Joins, Self-Joins and Multi-Way Joins Using Join Indices
H. Lei and K. A. Ross
International Workshop on Next Generation Information Technologies and Systems 1997
-
Supporting Multiple View Maintenance Policies
L. Colby, A. Kawaguchi, D. Lieuwen, I. Mumick, and K. A. Ross
SIGMOD 1997
-
Concurrency Control Theory for Deferred Materialized Views
A. Kawaguchi, D. Lieuwen, I. Mumick, D. Quass, and K. A. Ross
ICDT 1997
-
Monotonic Aggregation in Deductive Databases
K. A. Ross and Y. Sagiv
Journal of Computer and System Sciences 1997
-
The Stanford Digital Library Metadata Architecture
M. Baldonado, C.-C. K. Chang, L. Gravano, and A. Paepcke
International Journal on Digital Libraries 1997
-
Data Structures for Efficient Broker Implementation
A. Tomasic, L. Gravano, C. Lue, P. Schwarz, and L. Haas
TOIS 1997
-
Merging Ranks from Heterogeneous Internet Sources
L. Gravano and H. Garcia-Molina
VLDB 1997
-
Metadata for Digital Libraries: Architecture and Design Rationale
M. Baldonado, C.-C. K. Chang, L. Gravano, and A. Paepcke
JCDL 1997
-
STARTS: Stanford Proposal for Internet Meta-Searching
L. Gravano, C.-C. K. Chang, H. Garcia-Molina, and A. Paepcke
SIGMOD 1997
-
Resource Indexing and Discovery In a Globally Distributed Digital Library
L. Gravano
Position paper for the NSF-EU Digital Library Collaboratory Working Group, 1997
-
Querying Multiple Document Collections across the Internet
L. Gravano
Ph.D. Dissertation, Stanford University (advisor: H. Garcia-Molina), August 1997
-
Querying Multiple Features of Groups in Relational Databases
D. Chatziantoniou and K. A. Ross
VLDB 1996
-
Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time
K. A. Ross, D. Srivastava, and S. Sudarshan
SIGMOD 1996
-
Tail Recursion Elimination in Deductive Databases
K. A. Ross
ACM Transactions on Database Systems 1996
-
dSCAM: Finding Document Copies across Multiple Databases
H. Garcia-Molina, L. Gravano, and N. Shivakumar
PDIS 1996
-
Optimizing Queries over Multimedia Repositorie
S. Chaudhuri and L. Gravano
SIGMOD 1996
-
Optimizing Queries over Multimedia Repositories
S. Chaudhuri and L. Gravano
IEEE Data Eng. Bull 1996
-
Informal Internet Standards at Stanford
L. Gravano, C.-C. K. Chang, H. Garcia-Molina, and A. Paepcke
Position paper for the 1996 World-Wide Web Consortium (W3C) Distributed Indexing/Searching Workshop 1996
-
Adapting Materialized Views After Redefinitions: Techniques and a Performance Study
A. Gupta, I. S. Mumick, J. Rao, and K. A. Ross
SIGMOD 1995
-
Efficiently Following Object References for Large Object Collections and Small Main Memory
K. A. Ross
Fourth International Conference on Deductive and Object-Oriented Databases 1995
-
Structural Totality and Constraint Stratification
K. A. Ross
PODS 1995
-
PERF Join: An Alternative To Semijoin and Bloom Join
Z. Li and K. A. Ross
CIKM 1995
-
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies
L. Gravano and H. Garcia-Molina
VLDB 1995
-
Efficient Incremental Evaluation of Queries with Aggregation
R. Ramakrishnan, K. A. Ross, D. Srivastava, and S. Sudarshan
International Symposium on Logic Programming 1994
-
A Syntactic Stratification Condition Using Constraints
K. A. Ross
International Symposium on Logic Programming 1994
-
Constraint Stratification in Deductive Databases
K. A. Ross
ICLP Workshop on Deductive Databases 1994
-
Modular Stratification and Magic Sets for Datalog Programs with Negation
K. A. Ross
JCSS 1994
-
On Negation in HiLog
K. A. Ross
Journal of Logic Programming 1994
-
Storage-Efficient, Deadlock-Free Packet Routing Algorithms for Torus Networks
R. Cypher and L. Gravano
TC 1994
-
Requirements for Deadlock-Free, Adaptive Packet Routing
R. Cypher and L. Gravano
SIAM Journal on Computing 1994
-
Adaptive Deadlock- and Livelock-Free Routing with All Minimal Paths in Torus Networks
L. Gravano, G. Pifarre, P. Berman, and J. Sanz
TPDS 1994
-
Adaptive Deadlock- and Livelock-Free Routing in the Hypercube Network
G. Pifarre, L. Gravano, G. Denicolay, and J. Sanz
TPDS 1994
-
Fully Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and Other Networks: Algorithms and Simulations
G. Pifarre, L. Gravano, S. Felperin, and J. Sanz
TPDS 1994
-
Precision and Recall of GlOSS Estimators for Database Discovery
L. Gravano, H. Garcia-Molina, and A. Tomasic
PDIS 1994
-
The Effectiveness of GlOSS for the Text-Database Discovery Problem
L. Gravano, H. Garcia-Molina, and A. Tomasic
SIGMOD 1994
-
`Noodle: A Language for Declarative Querying in an Object-Oriented Database
I. S. Mumick and K. A. Ross
Third International Conference on Deductive and Object-Oriented Databases 1993
-
An Architecture for Declarative Object-Oriented Databases
I. S. Mumick and K. A. Ross
JILPS Workshop on Deductive Databases 1992
-
Relations with Relation Names as Arguments: Algebra and Calculus
K. A. Ross
PODS 1992
-
A Procedural Semantics for Well-Founded Negation in Logic Programs
K. A. Ross
Journal of Logic Programming 1992
-
Requirements for Deadlock-Free, Adaptive Packet Routing
R. Cypher and L. Gravano
PODC 1992
-
Adaptive, Deadlock-Free Packet Routing in Torus Networks with Minimal Storage
R. Cypher and L. Gravano
ICPP 1992
-
Adaptive Deadlock- and Livelock-Free Routing with All Minimal Paths in Torus Networks
P. Berman, L. Gravano, G. Pifarre, and J. Sanz
SPAA 1992
-
Adaptive Deadlock-Free Worm-Hole Routing in Hypercubes
L. Gravano, G. Pifarre, G. Denicolay, and J. Sanz
IPPS 1992
-
Modular Acyclicity and Tail Recursion in Logic Programs
K. A. Ross
PODS 1991
-
On Negation in HiLog
K. A. Ross
PODS 1991
-
Blending in the Ends of Chevron Stockpiles
G. K. Robinson and K. A. Ross
Bulk Solids Handling 1991
-
The Well-Founded Semantics for General Logic Programs
A. V. Gelder, K. A. Ross, and J. S. Schlipf
JACM 1991
-
Glue-Nail: A Deductive Database System
G. Phipps, M. A. Derr, and K. A. Ross
SIGMOD 1991
-
Fully-Adaptive Routing: Packet Switching Performance and Worm-Hole Algorithms
S. Felperin, L. Gravano, G. Pifarre, and J. Sanz
SC 1991
-
Fully-Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and Other Networks
G. Pifarre, L. Gravano, S. Felperin, and J. Sanz
SPAA 1991
-
Routing Techniques for Massively Parallel Communication
S. Felperin, L. Gravano, G. Pifarre, and J. Sanz
Proceedings of the IEEE 1991
-
Modular Stratification and Magic Sets for Datalog Programs with Negation
K. A. Ross
PODS 1990
-
A Procedural Semantics for Well-Founded Negation in Logic Programs
K. A. Ross
PODS 1989
-
The Well-Founded Semantics for Disjunctive Logic Programs
K. A. Ross
First International Conference on Deductive and Object Oriented Databases 1989
-
Unfounded Sets and Well-Founded Semantics for General Logic Programs
A. V. Gelder, K. A. Ross, and J. S. Schlipf
PODS 1988
-
Inferring Negative Information From Disjunctive Databases
K. A. Ross and R. W. Topor
Journal of Automated Reasoning 1988
-
Iteration of Some Discretizations of the Nonlinear Schrodinger Equation
K. A. Ross and C. J. Thompson
Physica 1986
-
Chaotic Planar States of the Discrete Dynamical Anisotropic Heisenberg Spin Chain
C. J. Thompson, K. A. Ross, B. J. P. Thompson, and M. Lakshmanan
Physica 1985