Publications

2026

  1. Flareon: Stealthy All2all Backdoor Injection via Poisoned Augmentation
    Flareon: Stealthy All2all Backdoor Injection via Poisoned Augmentation TKDD
    Tianrui Qin, Xuan Wang, Xianghuan He, and 4 more authors
    ACM Transactions on Knowledge Discovery from Data, 2026
  2. CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents
    CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents arXiv
    Hanna Foerster, Robert Mullins, Tom Blanchard, and 6 more authors
    arXiv preprint, 2026
  3. Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference
    Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference arXiv
    Yiren Zhao, and Junyi Liu
    arXiv preprint, 2026
  4. Deep Kernel Fusion for Transformers
    Deep Kernel Fusion for Transformers arXiv
    Zixi Zhang, Zhiwen Mo, Yiren Zhao, and 1 more author
    arXiv preprint, 2026
  5. Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
    Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling arXiv
    Jeffrey T H Wong, Zixi Zhang, Junyi Liu, and 1 more author
    arXiv preprint, 2026
  6. KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware
    KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware arXiv
    Jiayi Nie, Haoran Wu, Yao Lai, and 9 more authors
    arXiv preprint, 2026
  7. Beyond GEMM-Centric NPUs: Enabling Efficient Diffusion LLM Sampling
    Beyond GEMM-Centric NPUs: Enabling Efficient Diffusion LLM Sampling arXiv
    Binglei Lou, Haoran Wu, Yao Lai, and 6 more authors
    arXiv preprint, 2026
  8. On the Existence and Behaviour of Secondary Attention Sinks
    On the Existence and Behaviour of Secondary Attention Sinks ICLR Workshop
    Jeffrey T. H. Wong, Cheng Zhang, Louis Mahon, and 3 more authors
    In ICLR 2026 Workshops on Geometry-grounded Representation Learning and Generative Modeling (GRaM) and Uncertainty-Aware, Robust, and Trustworthy AI for Computer Vision (UCRL), 2026
  9. Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference
    Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference ISCA
    Haoran Wu, Can Xiao, Jiayi Nie, and 13 more authors
    In International Symposium on Computer Architecture (ISCA), 2026
  10. Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
    Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated SaTML
    Hanna Foerster, Ilia Shumailov, Yiren Zhao, and 4 more authors
    In 4th IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2026

2025

  1. Architectural Neural Backdoors from First Principles
    Architectural Neural Backdoors from First Principles S&P
    Harry Langford, Ilia Shumailov, Yiren Zhao, and 2 more authors
    In IEEE Symposium on Security and Privacy (S&P), 2025
  2. QERA: an Analytical Framework for Quantization Error Reconstruction
    QERA: an Analytical Framework for Quantization Error Reconstruction ICLR
    Cheng Zhang, Jeffrey T H Wong, Can Xiao, and 2 more authors
    In International Conference on Learning Representations (ICLR), 2025
  3. AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks EuroMLSys
    Matheus Gimenes, Yiren Zhao, and George A Constantinides
    In 5th EuroMLSys: Machine Learning and Systems, 2025
  4. Cached Multi-Lora Composition for Multi-Concept Image Generation
    Cached Multi-Lora Composition for Multi-Concept Image Generation ICLR
    Xiandong Zou, Mingzhu Shen, Christos-Savvas Bouganis, and 1 more author
    In International Conference on Learning Representations (ICLR), 2025
  5. Hardware and Software Platform Inference
    Hardware and Software Platform Inference ICML
    Cheng Zhang, Hanna Foerster, Robert D Mullins, and 2 more authors
    In International Conference on Machine Learning (ICML), 2025
  6. Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
    Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models ACL
    Xinxin Liu, Aaron Thomas, Cheng Zhang, and 3 more authors
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2025
  7. Locking Machine Learning Models into Hardware
    Locking Machine Learning Models into Hardware SaTML
    Eleanor Clifford, Adhithya Saravanan, Harry Langford, and 5 more authors
    In IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2025
  8. LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation
    LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation FCCM
    Zixi Zhang, Balint Szekely, Pedro Gimenes, and 5 more authors
    In IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2025
  9. Refining Datapath for Microscaling ViTs
    Refining Datapath for Microscaling ViTs FPL
    Can Xiao, Jianyi Cheng, and Yiren Zhao
    In International Conference on Field-Programmable Logic and Applications (FPL), 2025
  10. Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
    Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization EMNLP
    Guanghui Song, Dongping Liao, Yiren Zhao, and 3 more authors
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
  11. An Efficient SRAM Architecture for Transposed and Non-Transposed Memory Access
    An Efficient SRAM Architecture for Transposed and Non-Transposed Memory Access FPT
    Can Xiao, Haoyang Wu, Xinheng Guo, and 2 more authors
    In International Conference on Field-Programmable Technology (ICFPT), 2025
  12. Omni-DNA: A Genomic Model Supporting Sequence Understanding, Long-context, and Textual Annotation
    Omni-DNA: A Genomic Model Supporting Sequence Understanding, Long-context, and Textual Annotation NeurIPS
    Zehui Li, Vallijah Subasri, Yifei Shen, and 4 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2025
  13. A3: an Analytical Low-Rank Approximation Framework for Attention
    A3: an Analytical Low-Rank Approximation Framework for Attention arXiv
    Jeffrey T. H. Wong, Cheng Zhang, Xinye Cao, and 4 more authors
    arXiv preprint, 2025
  14. Microscaling Vision Transformers on FPGAs
    Microscaling Vision Transformers on FPGAs FCCM
    Can Xiao, Jianyi Cheng, and Yiren Zhao
    In IEEE Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2025
  15. AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
    AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks FPL
    Pedro Gimenes, Yiren Zhao, and George Constantinides
    In International Conference on Field-Programmable Logic and Applications (FPL), 2025
  16. ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments
    ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments arXiv
    Pedro Gimenes, Zeyu Cao, Jeffrey Wong, and 1 more author
    arXiv preprint, 2025
  17. Yorzoi: Predicting RNA-seq Coverage from DNA Sequence in Yeast
    Yorzoi: Predicting RNA-seq Coverage from DNA Sequence in Yeast arXiv
    Tim Schneider, Abdur Muntakim Rafi, Cameron Jensen, and 4 more authors
    arXiv preprint, 2025
  18. MC-LoRA: Fast Modular Composition for Multi-Character Diffusion Generation
    MC-LoRA: Fast Modular Composition for Multi-Character Diffusion Generation arXiv
    Enyan Xiao, Mingzhu Shen, Xiyu Zou, and 1 more author
    arXiv preprint, 2025

2024

  1. ImpNet: Imperceptible and blackbox-undetectable backdoors in compiled neural networks
    ImpNet: Imperceptible and blackbox-undetectable backdoors in compiled neural networks SaTML
    Eleanor Clifford, Ilia Shumailov, Yiren Zhao, and 2 more authors
    In IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2024
  2. AI models collapse when trained on recursively generated data
    AI models collapse when trained on recursively generated data Nature
    Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, and 3 more authors
    Nature, 2024
  3. LQER: Low-Rank Quantization Error Reconstruction for LLMs
    LQER: Low-Rank Quantization Error Reconstruction for LLMs ICML
    Cheng Zhang, Jianyi Cheng, George A Constantinides, and 1 more author
    In International Conference on Machine Learning (ICML), 2024
  4. HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator
    HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator FPL
    Zhewen Yu, Sudarshan Sreeram, Krish Agrawal, and 6 more authors
    In International Conference on Field-Programmable Logic and Applications (FPL), 2024
  5. Absorb & Escape: Overcoming Single Model Limitations in Generating Heterogeneous Genomic Sequences
    Absorb & Escape: Overcoming Single Model Limitations in Generating Heterogeneous Genomic Sequences NeurIPS
    Zehui Li, Yuhao Ni, Guoxuan Xia, and 4 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  6. GV-Rep: A Large-Scale Dataset for Genetic Variant Representation Learning
    GV-Rep: A Large-Scale Dataset for Genetic Variant Representation Learning NeurIPS
    Zehui Li, Vallijah Subasri, Guy-Bart Stan, and 2 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2024
  7. Enhancing Node Representations for Real-World Complex Networks with Topological Augmentation
    Enhancing Node Representations for Real-World Complex Networks with Topological Augmentation ECAI
    Xiangyu Zhao, Zehui Li, Mingzhu Shen, and 3 more authors
    In European Conference on Artificial Intelligence (ECAI), 2024
  8. ∆-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
    ∆-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers arXiv
    Pengtao Chen, Mingzhu Shen, Peng Ye, and 5 more authors
    arXiv preprint arXiv:2406.01125, 2024
  9. Verification and Fault Injection Platform Based on MTB Stimulus Generation Method for L2 Deep Market Quote Decoder
    Le Yu, Zhiheng Liang, Yaqi Li, and 3 more authors
    In IEEE Access, 2024
  10. Unlocking the Global Synergies in Low-Rank Adapters
    Unlocking the Global Synergies in Low-Rank Adapters ICML Workshop
    Zixi Zhang, Cheng Zhang, Xitong Gao, and 3 more authors
    In ICML 2024 Workshop on Efficient Systems for Foundation Models II (ES-FoMo II), 2024
  11. Optimised Grouped-Query Attention Mechanism for Transformers
    Optimised Grouped-Query Attention Mechanism for Transformers ICML Workshop
    Yuang Chen, Cheng Zhang, Xitong Gao, and 3 more authors
    In ICML 2024 Workshop on Efficient Systems for Foundation Models II (ES-FoMo II), 2024
  12. MD-DiT: Step-aware Mixture-of-Depths for Efficient Diffusion Transformers
    MD-DiT: Step-aware Mixture-of-Depths for Efficient Diffusion Transformers NeurIPS Workshop
    Mingzhu Shen, Pengxiang Chen, Peng Ye, and 4 more authors
    In NeurIPS 2024 Workshop: Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning, 2024
  13. Scaling Laws for Mixed Quantization in Large Language Models
    Scaling Laws for Mixed Quantization in Large Language Models arXiv
    Zeyu Cao, Boyang Gu, Cheng Zhang, and 5 more authors
    arXiv preprint, 2024

2023

  1. Neural network activation compression with non-uniform mantissas Patent
    Daniel Lo, Amar Phanishayee, Eric S Chung, and 1 more author
    Jan 2023
    US Patent 11,562,247
  2. Architectural backdoors in neural networks
    Architectural backdoors in neural networks CVPR
    Mikel Bober-Irizar, Ilia Shumailov, Yiren Zhao, and 2 more authors
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jan 2023
  3. Augmentation Backdoors
    Augmentation Backdoors ICLR Workshop
    Joseph Rance, Yiren Zhao, Ilia Shumailov, and 1 more author
    In ICLR 2023 Workshop on Backdoor Attacks and Defenses in Machine Learning (BANDS), Jan 2023
  4. Revisiting Structured Dropout
    Revisiting Structured Dropout
    Yiren Zhao, Oluwatomisin Dada, Xitong Gao, and 1 more author
    In Asian Conference on Machine Learning (ACML), Jan 2023
  5. Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration
    Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration ICLR Workshop
    Xiangyu Zhao, Hannes Stärk, Dominique Beaini, and 2 more authors
    In ICLR 2023 Workshop on Machine Learning for Drug Discovery, Jan 2023
  6. Dynamic Stashing Quantization for Efficient Transformer Training
    Dynamic Stashing Quantization for Efficient Transformer Training EMNLP
    Guo Yang, Daniel Lo, Robert Mullins, and 1 more author
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Jan 2023
  7. Revisiting Automated Prompting: Are We Actually Doing Better?
    Revisiting Automated Prompting: Are We Actually Doing Better? ACL
    Yulin Zhou, Yiren Zhao, Ilia Shumailov, and 2 more authors
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), Jan 2023
  8. Neural network activation compression with non-uniform mantissas Patent
    Daniel Lo, Amar Phanishayee, Eric S Chung, and 1 more author
    May 2023
    US Patent App. 18/092,876
  9. Adaptive Channel Sparsity for Federated Learning Under System Heterogeneity
    Adaptive Channel Sparsity for Federated Learning Under System Heterogeneity CVPR
    Dongping Liao, Xitong Gao, Yiren Zhao, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, May 2023
  10. Hybrid Graph: A Unified Graph Representation with Datasets and Benchmarks for Complex Graphs
    Hybrid Graph: A Unified Graph Representation with Datasets and Benchmarks for Complex Graphs arXiv
    Zehui Li, Xiangyu Zhao, Mingzhu Shen, and 3 more authors
    arXiv preprint arXiv:2306.05108, May 2023
  11. Genomic Interpreter: A Hierarchical Genomic Deep Neural Network with 1D Shifted Window Transformer
    Genomic Interpreter: A Hierarchical Genomic Deep Neural Network with 1D Shifted Window Transformer ICML Workshop
    Zehui Li, Akashaditya Das, William AV Beardall, and 2 more authors
    In ICML 2023 Workshop on Computational Biology, May 2023
  12. Fast Prototyping Next-Generation Accelerators for New ML Models using MASE: ML Accelerator System Exploration
    Fast Prototyping Next-Generation Accelerators for New ML Models using MASE: ML Accelerator System Exploration arXiv
    Jianyi Cheng, Cheng Zhang, Zhewen Yu, and 4 more authors
    arXiv preprint arXiv:2307.15517, May 2023
  13. Will More Expressive Graph Neural Networks do Better on Generative Tasks?
    Will More Expressive Graph Neural Networks do Better on Generative Tasks? LoG
    Xiandong Zou, Xiangyu Zhao, Pietro Liò, and 1 more author
    In Learning on Graphs Conference (LoG 2023), May 2023
  14. DiscDiff: Latent Diffusion Model for DNA Sequence Generation
    DiscDiff: Latent Diffusion Model for DNA Sequence Generation NeurIPS Workshop
    Zehui Li, Yuhao Ni, William A V Beardall, and 4 more authors
    In NeurIPS 2023 Workshop: AI for Science: from Theory to Practice, May 2023
  15. MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration
    MASE: An Efficient Representation for Software-Defined ML Hardware System Exploration NeurIPS Workshop
    Cheng Zhang, Jianyi Cheng, Zhihang Yu, and 1 more author
    In NeurIPS 2023 Workshop: Machine Learning for Systems, May 2023
  16. Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
    Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference? EMNLP
    Cheng Zhang, Jianyi Cheng, Ilia Shumailov, and 2 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), May 2023
  17. MiliPoint: A Point Cloud Dataset for mmWave Radar
    MiliPoint: A Point Cloud Dataset for mmWave Radar NeurIPS
    Han Cui, Shu Zhong, Jiacheng Wu, and 3 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), May 2023
  18. A Dataflow Compiler for Efficient LLM Inference using Custom Microscaling Formats
    A Dataflow Compiler for Efficient LLM Inference using Custom Microscaling Formats arXiv
    Jianyi Cheng, Cheng Zhang, Zhewen Yu, and 3 more authors
    arXiv preprint, May 2023

2022

  1. DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning
    DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning ICML
    Robert Hönig, Yiren Zhao, and Robert Mullins
    In International Conference on Machine Learning, May 2022
  2. FedDrop: Trajectory-weighted Dropout for Efficient Federated Learning
    FedDrop: Trajectory-weighted Dropout for Efficient Federated Learning ICLR
    Dongping Liao, Xitong Gao, Yiren Zhao, and 6 more authors
    In International Conference on Learning Representations (ICLR), May 2022
  3. Model Architecture Adaption for Bayesian Neural Networks
    Model Architecture Adaption for Bayesian Neural Networks arXiv
    Duo Wang, Yiren Zhao, Ilia Shumailov, and 1 more author
    arXiv preprint arXiv:2202.04392, May 2022
  4. Efficient Adversarial Training With Data Pruning
    Efficient Adversarial Training With Data Pruning arXiv
    Maximilian Kaufmann, Yiren Zhao, Ilia Shumailov, and 2 more authors
    arXiv preprint arXiv:2207.00694, May 2022
  5. Wide Attention Is The Way Forward For Transformers
    Wide Attention Is The Way Forward For Transformers NeurIPS Workshop
    Jason Ross Brown, Yiren Zhao, Ilia Shumailov, and 1 more author
    In NeurIPS 2022 Workshop on Attention: Challenges and Opportunities, May 2022
  6. DARTFormer: Finding The Best Type Of Attention
    DARTFormer: Finding The Best Type Of Attention NeurIPS Workshop
    Jason Ross Brown, Yiren Zhao, Ilia Shumailov, and 1 more author
    In NeurIPS 2022 Workshop: I Can’t Believe It’s Not Better – Understanding Deep Learning Through Empirical Falsification, May 2022
  7. Revisiting Embeddings for Graph Neural Networks
    Revisiting Embeddings for Graph Neural Networks LoG
    Skye Purchase, Yiren Zhao, and Robert D Mullins
    In Learning on Graphs Conference, May 2022
  8. Rapid Model Architecture Adaption for Meta-Learning
    Rapid Model Architecture Adaption for Meta-Learning NeurIPS
    Yiren Zhao, Xitong Gao, Ilia Shumailov, and 2 more authors
    In Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), May 2022

2021

  1. Probabilistic Dual Network Architecture Search on Graphs
    Probabilistic Dual Network Architecture Search on Graphs AAAI Workshop
    Yiren Zhao, Duo Wang, Xitong Gao, and 3 more authors
    In AAAI 2021 Workshop on Deep Learning on Graphs: Method and Applications (DLG-AAAI’21), May 2021
  2. Sponge examples: Energy-latency attacks on neural networks
    Sponge examples: Energy-latency attacks on neural networks EuroS&P
    Ilia Shumailov, Yiren Zhao, Daniel Bates, and 3 more authors
    In 2021 IEEE European Symposium on Security and Privacy (EuroS&P), May 2021
  3. Learned Low Precision Graph Neural Networks
    Learned Low Precision Graph Neural Networks EuroMLSys
    Yiren Zhao, Duo Wang, Daniel Bates, and 3 more authors
    In 1st EuroMLSys: Machine Learning and Systems, May 2021
  4. Manipulating sgd with data ordering attacks
    Manipulating sgd with data ordering attacks NeurIPS
    Ilia Shumailov, Zakhar Shumaylov, Dmitry Kazhdan, and 4 more authors
    Advances in Neural Information Processing Systems, May 2021
  5. Markpainting: Adversarial Machine Learning meets Inpainting
    Markpainting: Adversarial Machine Learning meets Inpainting ICML
    David Khachaturov, Ilia Shumailov, Yiren Zhao, and 2 more authors
    In 38th International Conference on Machine Learning, May 2021

2020

  1. Blackbox attacks on reinforcement learning agents using approximated temporal information
    Blackbox attacks on reinforcement learning agents using approximated temporal information DSN-W Workshop
    Yiren Zhao, Ilia Shumailov, Han Cui, and 3 more authors
    In 2020 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), May 2020
  2. Towards certifiable adversarial sample detection
    Towards certifiable adversarial sample detection AISec Workshop
    Ilia Shumailov, Yiren Zhao, Robert Mullins, and 1 more author
    In Proceedings of the 13th ACM Workshop on Artificial Intelligence and Security, May 2020
  3. Pay Attention to Features, Transfer Learn Faster CNNs
    Pay Attention to Features, Transfer Learn Faster CNNs ICLR
    Kafeng Wang, Xitong Gao, Yiren Zhao, and 3 more authors
    In International Conference on Learning Representations, May 2020
  4. Nudge Attacks on Point-Cloud DNNs
    Nudge Attacks on Point-Cloud DNNs arXiv
    Yiren Zhao, Ilia Shumailov, Robert Mullins, and 1 more author
    arXiv preprint arXiv:2011.11637, May 2020
  5. Adjusting activation compression for neural network training Patent
    Daniel Lo, Bita Darvish Rouhani, Eric S Chung, and 3 more authors
    Aug 2020
    US Patent App. 16/276,395
  6. Neural network activation compression with narrow block floating-point Patent
    Daniel Lo, Amar Phanishayee, Eric S Chung, and 2 more authors
    Jul 2020
    US Patent App. 16/237,197
  7. Neural network activation compression with outlier block floating-point Patent
    Daniel Lo, Amar Phanishayee, Eric S Chung, and 2 more authors
    Jul 2020
    US Patent App. 16/237,202

2019

  1. Dynamic Channel Pruning: Feature Boosting and Suppression
    Dynamic Channel Pruning: Feature Boosting and Suppression ICLR
    Xitong Gao, Yiren Zhao, Lukasz Dudziak, and 2 more authors
    In International Conference on Learning Representations (ICLR), Jul 2019
  2. Sitatapatra: Blocking the Transfer of Adversarial Samples
    Sitatapatra: Blocking the Transfer of Adversarial Samples arXiv
    Ilia Shumailov, Xitong Gao, Yiren Zhao, and 3 more authors
    arXiv preprint arXiv:1901.08121, Jul 2019
  3. Focused quantization for sparse CNNs
    Focused quantization for sparse CNNs NeurIPS
    Yiren Zhao, Xitong Gao, Daniel Bates, and 2 more authors
    In Advances in Neural Information Processing Systems, Jul 2019
  4. Characterizing Sources of Ineffectual Computations in Deep Learning Networks
    Characterizing Sources of Ineffectual Computations in Deep Learning Networks ISPASS
    Miloš Nikolić, Mostafa Mahmoud, Andreas Moshovos, and 2 more authors
    In 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Jul 2019
  5. Automatic generation of multi-precision multi-arithmetic CNN accelerators for FPGAs
    Automatic generation of multi-precision multi-arithmetic CNN accelerators for FPGAs FPT
    Yiren Zhao, Xitong Gao, Xuan Guo, and 6 more authors
    In 2019 International Conference on Field-Programmable Technology (ICFPT), Jul 2019

2018

  1. Redundancy-reduced mobilenet acceleration on reconfigurable logic for imagenet classification
    Redundancy-reduced mobilenet acceleration on reconfigurable logic for imagenet classification ARC
    Jiang Su, Julian Faraone, Junyi Liu, and 4 more authors
    In Applied Reconfigurable Computing. Architectures, Tools, and Applications: 14th International Symposium, ARC 2018, Santorini, Greece, May 2-4, 2018, Proceedings 14, Jul 2018
  2. Mayo: A Framework for Auto-generating Hardware Friendly Deep Neural Networks
    Mayo: A Framework for Auto-generating Hardware Friendly Deep Neural Networks EMDL Workshop
    Yiren Zhao, Xitong Gao, Robert Mullins, and 1 more author
    In Proceedings of the 2nd International Workshop on Embedded and Mobile Deep Learning (EMDL 2018), Jul 2018
  3. To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression
    To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression MLSys
    Yiren Zhao, Ilia Shumailov, Robert Mullins, and 1 more author
    In The Conference on Systems and Machine Learning (SysML), Jul 2018
  4. The Taboo Trap: Behavioural Detection of Adversarial Samples
    The Taboo Trap: Behavioural Detection of Adversarial Samples arXiv
    Ilia Shumailov, Yiren Zhao, Robert Mullins, and 1 more author
    arXiv preprint arXiv:1811.07375, Jul 2018

2016

  1. An efficient implementation of online arithmetic
    An efficient implementation of online arithmetic FPT
    Yiren Zhao, John Wickerson, and George A Constantinides
    In 2016 International Conference on Field-Programmable Technology (FPT), Jul 2016