Selected Publications (Show All)

  1. Adaptive Code Generation for Data-Intensive Analytics
    Wangda Zhang, Junyoung Kim, Kenneth A. Ross, Eric Sedlar, Lucas Stadler
    VLDB 2021
  2. Physical Visualization Design
    Lana Ramjit, Zhaoning Kong, Ravi Netravali, Eugene Wu
    SIGMOD (demo) 2020
  3. VIP: A SIMD Vectorized Analytical Query Engine
    Orestis Polychroniou, Kenneth A. Ross
    VLDB Journal 2020
  4. Parallel Prefix Sum with SIMD
    Wangda Zhang, Yanbin Wang, Kenneth A. Ross
    ADMS 2020
  5. Permutation Index: Exploiting Data Skew for Improved Query Performance
    Wangda Zhang, Kenneth A. Ross
    ICDE 2020
  6. Exploiting Data Skew for Improved Query Performance
    Wangda Zhang, Kenneth A. Ross
    IEEE TKDE 2020
  7. Efficient Search over Genomic Short Read Data
    Wangda Zhang, Mengdi Lin, Kenneth A. Ross
    SSDBM 2020
  8. Towards Complaint-driven ML Workflow Debugging
    Lampros Flokas, Young Wu, Jiannan Wang, Eugene Wu
    MLOps 2020
  9. Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces
    Yiru Chen, Eugene Wu
    Intelligent Process Automation (IPA) 2020
  10. Towards Practical Vectorized Analytical Query Engines
    Orestis Polychroniou, Kenneth A. Ross
    DaMoN 2019
  11. Master of None Acceleration: A Comparison of Accelerator Architectures for Analytical Query Processing
    Andrea Lottarini, João Pedro Cerqueira, Thomas J. Repetti, Stephen A. Edwards, Kenneth A. Ross, Mingoo Seok, Martha A. Kim
    ISCA 2019
  12. Precision Interfaces
    Qianrui Zhang, Haoci Zhang, Viraj Rai, Thibault Sellam, Eugene Wu
    SIGMOD 2019
  13. DeepBase: Deep Inspection of Neural Networks
    Thibault Sellam, Kevin Lin, Ian Yiran Huang, Michelle Yang, Carl Vondrick, Eugene Wu
    SIGMOD 2019
  14. Distributed Joins and Data Placement for Minimal Network Traffic
    Orestis Polychroniou, Wangda Zhang, Kenneth A. Ross
    TODS 2018
  15. Ten Years of Web Tables
    Michael Cafarella, Alon Halevy, Daisy Zhe Wang, Hongrae Lee, Jayant Madhavan, Cong Yu, Eugene Wu,
    PVLDB 2018 Invited Paper,
  16. At a Glance: Approximate Entropy as a Measure of Line Chart Visualization Complexity
    Gabriel Ryan, Abigail Mosca, Remco Chang, Eugene Wu
    InfoVIS 2018
  17. Provenance in Interactive Visualizations
    Fotis Psallidas, Eugene Wu
    HILDA 2018
  18. Leveraging Quality Prediction Models for Automatic Writing Feedback
    Hamed Nilforoshan, Eugene Wu
    ICWSM 2018
  19. Precision Interfaces for Different Modalities
    HaoCi Zhang, Viraj Rai, Thibault Sellam, Eugene Wu
    SIGMOD (demo) 2018
  20. “I Like the Way You Think!” Inspecting the Internal Logic of Recurrent Neural Networks
    Thibault Sellam, Kevin Lin, Ian Yiran Huang, Carl Vondrick, Eugene Wu
    SysML 2018
  21. Smoke: Fine-grained Lineage at Interactive Speeds
    Fotis Psallidas, Eugene Wu
    VLDB 2018
  22. BoostClean: Automated Error Detection and Repair for Machine Learning
    Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Eugene Wu
    Tech Report 2017
  23. Network Synthesis for Database Processing Units
    Andrea Lottarini, Stephen A. Edwards, Kenneth A. Ross, Martha A. Kim
    DAC 2017
  24. Deadlock-free joins in DB-mesh, an asynchronous systolic array accelerator
    Bingyi Cao, Kenneth A. Ross, Stephen A. Edwards, Martha A. Kim
    DAMON 2017
  25. Combining Design and Performance in a Data Visualization Management System
    Eugene Wu, Fotis Psallidas, Zhengjie Miao, Haoci Zhang,Laura Rettig, Yifan Wu, Thibault Sellam
    CIDR 2017
  26. A DeVIL-ish Approach to Inconsistency in Interactive Visualizations
    Yifan Wu, Joe Hellerstein, Eugene Wu
    Hilda 2016
  27. PFunk-H: Approximate Query Processing using Perceptual Models
    Daniel Alabi, Eugene Wu
    Hilda 2016
  28. Towards Reliable Interactive Data Cleaning: A User Survey and Recommendations
    Sanjay Krishnan, Daniel Haas, Michael J. Franklin, Eugene Wu
    Hilda 2016
  29. ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning
    Sanjay Krishnan, Michael Franklin, Ken Goldberg, Jiannan Wang, Eugene Wu
    SIGMOD 2016 Demo
  30. SIMD-accelerated regular expression matching
    E. A. Sitaridi, O. Polychroniou, K. A. Ross
    DAMON 2016
  31. k-Shape: Efficient and Accurate Clustering of Time Series
    J. Paparrizos and L. Gravano
    SIGMOD Record 2016
  32. Detecting Devastating Diseases in Search Logs
    J. Paparrizos, R. W. White, and E. Horvitz
    SIGKDD 2016
  33. Screening for Pancreatic Adenocarcinoma Using Signals From Web Search Logs: Feasibility Study and Results
    J. Paparrizos, R. W. White, and E. Horvitz
    Journal of Oncology Practice
  34. CLAMShell: Speeding up Crowds for Low-latency Data Labeling
    D. Haas, J. Wang, E. Wu, and M J. Franklin
    VLDB 2016
  35. Massively-Parallel Lossless Data Decompression
    Evangelia A. Sitaridi, RenŽ MŸller, Tim Kaldewey, Guy M. Lohman, Kenneth A. Ross
    ICPP 2016
  36. A Course on Programming and Problem Solving
    S. Sheth, C. Murphy, K. A. Ross, D. E. Shasha
    SIGCSE 2016
  37. GPU-accelerated string matching for database applications
    E. Sitaridi and K. A. Ross
    VLDB Journal 2016
  38. Exploiting SSDs in operational multiversion databases
    M. Sadoghi, K. A. Ross, M. Canim, B. Bhattacharjee
    VLDB Journal 2016
  39. Towards Perception-aware Interactive Data Visualization Systems
    E. Wu and A. Nandi
    DSIA 2015
  40. SampleClean: Fast and Reliable Analytics on Dirty Data
    S. Krishnan, J. Wang, M. J. Franklin, K. Goldberg, T. Kraska, T. Milo, and E. Wu
    Overview paper
  41. The Q100 Database Processing Unit
    L. Wu, A. Lottarini, T. K. Paine, M. A. Kim, K. A. Ross
    IEEE MICRO 2015
  42. Efficient Lightweight Compression Alongside Fast Scans
    O. Polychroniou and K. A. Ross
    DAMON 2015
  43. Implementing Latency-Insensitive Dataflow Blocks
    B. Cao, K. A. Ross, M. A. Kim, and S. A. Edwards
    MEMOCODE 2015
  44. Wisteria: Nurturing Scalable Data Cleaning Infrastructure (Demo)
    D. Haas, S. Krishnan, J. Wang, M. J. Franklin, and E. Wu
    VLDB 2015
  45. Collaborative Data Analytics with Datahub (Demo)
    A. Bhardwaj, A. Deshpande, A. Elmore, D. Karger, S. Madden, A. Parameswaran, H. Subramanyam, E. Wu, and R. Zhang
    VLDB 2015
  46. Ranking Deep Web Text Collections for Scalable Information Extraction
    P. Barrio, L. Gravano, and C. Develder
    CIKM 2015
  47. k-Shape: Efficient and Accurate Clustering of Time Series
    J. Paparrizos and L. Gravano
    SIGMOD 2015
  48. Learning to Rank Adaptively for Scalable Information Extraction
    P. Barrio, G. Sim›es, H. Galhardas, and L. Gravano
    EDBT 2015
  49. Rethinking SIMD Vectorization for In-Memory Databases
    O. Polychroniou, A. Raghavan, K. A. Ross
    SIGMOD 2015
  50. The Case for Data Visualization Management Systems
    E. Wu, L. Battle, and S. Madden
    VLDB 2014
  51. Hardware Partitioning for Big Data Analytics
    L. Wu, R. J. Barker, M. A. Kim, K. A. Ross:
    IEEE MICRO 2014
  52. Reducing Database Locking Contention Through Multi-version Concurrency
    M. Sadoghi, M. Canim, B. Bhattacharjee, F. Nagel, K. A. Ross
    PVLDB 2014
  53. Energy Analysis of Hardware and Software Range Partitioning
    L. Wu, O. Polychroniou, R. J. Barker, M. A. Kim, and K. A. Ross
    TOCS 2014
  54. Coherent Somatic Mutation in Autoimmune Disease
    K. A. Ross
    PLoS One 2014
  55. Vectorized Bloom Filters for Advanced SIMD Processors
    O. Polychroniou and K. A. Ross
    DAMON 2014
  56. Q100: The Architecture and Design of a Database Processing Unit
    L. Wu, A. Lottarini, T. K. Paine, M. A. Kim, and K. A. Ross
    ASPLOS 2014
  57. A Comprehensive Study of Main-memory Partitioning and its Application to Large-scale Comparison- and Radix-sort
    O. Polychroniou and K. A. Ross
    SIGMOD 2014
  58. Track Join: Distributed Joins with Minimal Network Traffic
    O. Polychroniou, R. Sen, and K. A. Ross
    SIGMOD 2014
  59. Detecting Foodborne Disease Outbreaks Using Social Media (demonstration)
    F. Psallidas, L. Gravano, and many others
    NYC Media Lab's Annual Summit, 2014
  60. Information Extraction from Social Media for Public Health
    N. Elhadad, L. Gravano, D. Hsu, S. Balter, V. Reddy, and H. Waechter
    KDD at Bloomberg Workshop, Data Frameworks Track (KDD 2014), 2014
  61. REEL: A Relation Extraction Learning Framework (poster)
    P. Barrio, G. Sim›es, H. Galhardas, and L. Gravano
    JCDL 2014
  62. Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness Ñ New York City, 2012Ð2013
    C. Harrison, M. Jorder, H. Stern, F. Stavinsky, V. Reddy, H. Hanson, H. Waechter, L. Lowe, L. Gravano, and S. Balter
    Centers for Disease Control and Prevention Morbidity and Mortality Weekly Report 2014