Machine Learning

Authors and titles for recent submissions

[ total of 1194 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 1151-1194 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 3 Jun 2024 (showing first 50 of 166 entries)

[1] arXiv:2405.21064 [pdf, other]: Title: Recurrent neural networks: vanishing and exploding gradients are not the end of the story

Authors: Nicolas Zucchet, Antonio Orvieto

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[2] arXiv:2405.21063 [pdf, other]: Title: Neural Network Verification with Branch-and-Bound for General Nonlinearities

Authors: Zhouxing Shi, Qirui Jin, Zico Kolter, Suman Jana, Cho-Jui Hsieh, Huan Zhang

Comments: Preprint

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3] arXiv:2405.21061 [pdf, other]: Title: Graph External Attention Enhanced Transformer

Authors: Jianqing Liang, Min Chen, Jiye Liang

Comments: In Proceedings of ICML 2024

Subjects: Machine Learning (cs.LG)
[4] arXiv:2405.21060 [pdf, other]: Title: Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Authors: Tri Dao, Albert Gu

Comments: ICML 2024

Subjects: Machine Learning (cs.LG)
[5] arXiv:2405.21046 [pdf, other]: Title: Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Authors: Tengyang Xie, Dylan J. Foster, Akshay Krishnamurthy, Corby Rosset, Ahmed Awadallah, Alexander Rakhlin

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[6] arXiv:2405.21045 [pdf, ps, other]: Title: An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction

Authors: Qinhua Jiang, Xishun Liao, Yaofa Gong, Jiaqi Ma

Subjects: Machine Learning (cs.LG)
[7] arXiv:2405.21043 [pdf, other]: Title: Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

Authors: Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans

Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[8] arXiv:2405.21042 [pdf, other]: Title: Comparing information content of representation spaces for disentanglement with VAE ensembles

Authors: Kieran A. Murphy, Sam Dillavou, Dani S. Bassett

Comments: Code: this https URL

Subjects: Machine Learning (cs.LG)
[9] arXiv:2405.21036 [pdf, ps, other]: Title: A-PETE: Adaptive Prototype Explanations of Tree Ensembles

Authors: Jacek Karolczak, Jerzy Stefanowski

Subjects: Machine Learning (cs.LG)
[10] arXiv:2405.21021 [pdf, other]: Title: Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging

Authors: Niloufar Zakariaei, Arman Rahmim, Eldad Haber

Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Dynamical Systems (math.DS)
[11] arXiv:2405.21018 [pdf, other]: Title: Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

Authors: Xiaojun Jia, Tianyu Pang, Chao Du, Yihao Huang, Jindong Gu, Yang Liu, Xiaochun Cao, Min Lin

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[12] arXiv:2405.21012 [pdf, other]: Title: G-Transformer for Conditional Average Potential Outcome Estimation over Time

Authors: Konstantin Hess, Dennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel

Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[13] arXiv:2405.21003 [pdf, other]: Title: Explaining Predictions by Characteristic Rules

Authors: Amr Alkhatib, Henrik Boström, Michalis Vazirgiannis

Comments: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022

Journal-ref: In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13713. Springer, Cham (2023)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[14] arXiv:2405.20988 [pdf, other]: Title: Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging

Authors: Michail Theologitis, Georgios Frangias, Georgios Anestis, Vasilis Samoladas, Antonios Deligiannakis

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[15] arXiv:2405.20986 [pdf, other]: Title: Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks

Authors: Linlin Yu, Bowen Yang, Tianhao Wang, Kangshuo Li, Feng Chen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2405.20984 [pdf, other]: Title: Bayesian Design Principles for Offline-to-Online Reinforcement Learning

Authors: Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang

Comments: Forty-first International Conference on Machine Learning (ICML), 2024

Subjects: Machine Learning (cs.LG)
[17] arXiv:2405.20973 [pdf, other]: Title: LCQ: Low-Rank Codebook based Quantization for Large Language Models

Authors: Wen-Pu Cai, Wu-Jun Li

Comments: 10 pages, 5 figures

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[18] arXiv:2405.20971 [pdf, other]: Title: Amortizing intractable inference in diffusion models for vision, language, and control

Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

Comments: Code: this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[19] arXiv:2405.20954 [pdf, other]: Title: Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score

Authors: Nathan Tsoi, Deyuan Li, Taesoo Daniel Lee, Marynel Vázquez

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[20] arXiv:2405.20935 [pdf, other]: Title: Effective Interplay between Sparsity and Quantization: From Theory to Practice

Authors: Simla Burcu Harma, Ayan Chakraborty, Elizaveta Kostenok, Danila Mishin, Dongho Ha, Babak Falsafi, Martin Jaggi, Ming Liu, Yunho Oh, Suvinay Subramanian, Amir Yazdanbakhsh

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[21] arXiv:2405.20933 [pdf, ps, other]: Title: Concentration Bounds for Optimized Certainty Equivalent Risk Estimation

Authors: Ayon Ghosh, L.A. Prashanth, Krishna Jagannathan

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[22] arXiv:2405.20915 [pdf, other]: Title: Fast yet Safe: Early-Exiting with Risk Control

Authors: Metod Jazbec, Alexander Timans, Tin Hadži Veljković, Kaspar Sakmann, Dan Zhang, Christian A. Naesseth, Eric Nalisnick

Comments: 25 pages, 11 figures, 4 tables (incl. appendix)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[23] arXiv:2405.20905 [pdf, other]: Title: VENI, VINDy, VICI: a variational reduced-order modeling framework with uncertainty quantification

Authors: Paolo Conti, Jonas Kneifl, Andrea Manzoni, Attilio Frangi, Jörg Fehr, Steven L. Brunton, J. Nathan Kutz

Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Dynamical Systems (math.DS)
[24] arXiv:2405.20882 [pdf, other]: Title: Sheaf HyperNetworks for Personalized Federated Learning

Authors: Bao Nguyen, Lorenzo Sani, Xinchi Qiu, Pietro Liò, Nicholas D. Lane

Comments: 25 pages, 12 figures, 7 tables, pre-print under review

Subjects: Machine Learning (cs.LG)
[25] arXiv:2405.20879 [pdf, other]: Title: Flow matching achieves minimax optimal convergence

Authors: Kenji Fukumizu, Taiji Suzuki, Noboru Isobe, Kazusato Oko, Masanori Koyama

Subjects: Machine Learning (cs.LG)
[26] arXiv:2405.20860 [pdf, other]: Title: Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation

Authors: Shangding Gu, Laixi Shi, Yuhao Ding, Alois Knoll, Costas Spanos, Adam Wierman, Ming Jin

Subjects: Machine Learning (cs.LG)
[27] arXiv:2405.20838 [pdf, other]: Title: einspace: Searching for Neural Architectures from Fundamental Operations

Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

Comments: Project page at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[28] arXiv:2405.20835 [pdf, other]: Title: Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs

Authors: Davide Paglieri, Saurabh Dash, Tim Rocktäschel, Jack Parker-Holder

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[29] arXiv:2405.20824 [pdf, ps, other]: Title: Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously

Authors: Stephen Pasteris, Chris Hicks, Vasilios Mavroudis, Mark Herbster

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[30] arXiv:2405.20821 [pdf, other]: Title: Pursuing Overall Welfare in Federated Learning through Sequential Decision Making

Authors: Seok-Ju Hahn, Gi-Soo Kim, Junghye Lee

Comments: Accepted at ICML 2024

Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[31] arXiv:2405.20800 [pdf, other]: Title: Shape Constraints in Symbolic Regression using Penalized Least Squares

Authors: Viktor Martinek, Julia Reuter, Ophelia Frotscher, Sanaz Mostaghim, Markus Richter, Roland Herzog

Subjects: Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[32] arXiv:2405.20794 [pdf, ps, other]: Title: Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models

Authors: Donald Kridel, Jacob Dineen, Daniel Dolk, David Castillo

Subjects: Machine Learning (cs.LG)
[33] arXiv:2405.20790 [pdf, other]: Title: Intersectional Unfairness Discovery

Authors: Gezheng Xu, Qi Chen, Charles Ling, Boyu Wang, Changjian Shui

Comments: ICML-2024 Camera-ready

Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[34] arXiv:2405.20772 [pdf, ps, other]: Title: Reinforcement Learning for Sociohydrology

Authors: Tirthankar Roy, Shivendra Srivastava, Beichen Zhang

Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[35] arXiv:2405.20763 [pdf, other]: Title: Improving Generalization and Convergence by Enhancing Implicit Regularization

Authors: Mingze Wang, Haotian He, Jinbo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

Comments: 35 pages

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[36] arXiv:2405.20761 [pdf, other]: Title: Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning

Authors: Aditya Shankar, Lydia Y. Chen, Jérémie Decouchant, Dimitra Gkorou, Rihan Hai

Comments: Submitted to the 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2024)

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[37] arXiv:2405.20759 [pdf, other]: Title: Information Theoretic Text-to-Image Alignment

Authors: Chao Wang, Giulio Franzese, Alessandro Finamore, Massimo Gallo, Pietro Michiardi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[38] arXiv:2405.20738 [pdf, other]: Title: Federated Random Forest for Partially Overlapping Clinical Data

Authors: Youngjun Park, Cord Eric Schmidt, Benedikt Marcel Batton, Anne-Christin Hauschild

Subjects: Machine Learning (cs.LG)
[39] arXiv:2405.20724 [pdf, other]: Title: Learning on Large Graphs using Intersecting Communities

Authors: Ben Finkelshtein, İsmail İlkan Ceylan, Michael Bronstein, Ron Levie

Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[40] arXiv:2405.20692 [pdf, other]: Title: In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought

Authors: Sili Huang, Jifeng Hu, Hechang Chen, Lichao Sun, Bo Yang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[41] arXiv:2405.20690 [pdf, other]: Title: Unleashing the Potential of Diffusion Models for Incomplete Data Imputation

Authors: Hengrui Zhang, Liancheng Fang, Philip S. Yu

Subjects: Machine Learning (cs.LG)
[42] arXiv:2405.20685 [pdf, other]: Title: Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space

Authors: Yukai Zhang, Ao Xu, Zihao Li, Tieru Wu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[43] arXiv:2405.20678 [pdf, ps, other]: Title: No-Regret Learning for Fair Multi-Agent Social Welfare Optimization

Authors: Mengxiao Zhang, Ramiro Deo-Campo Vuong, Haipeng Luo

Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[44] arXiv:2405.20677 [pdf, other]: Title: Provably Efficient Interactive-Grounded Learning with Personalized Reward

Authors: Mengxiao Zhang, Yuheng Zhang, Haipeng Luo, Paul Mineiro

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[45] arXiv:2405.20671 [pdf, other]: Title: Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers

Authors: Hanseul Cho, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun

Comments: 73 pages, 20 figures, 90 tables

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[46] arXiv:2405.20664 [pdf, other]: Title: Weak Robust Compatibility Between Learning Algorithms and Counterfactual Explanation Generation Algorithms

Authors: Ao Xu, Tieru Wu

Subjects: Machine Learning (cs.LG)
[47] arXiv:2405.20652 [pdf, other]: Title: Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Authors: Langzhang Liang, Sunwoo Kim, Kijung Shin, Zenglin Xu, Shirui Pan, Yuan Qi

Comments: Published as a conference paper at ICML 2024

Subjects: Machine Learning (cs.LG)
[48] arXiv:2405.20642 [pdf, other]: Title: Principal-Agent Multitasking: the Uniformity of Optimal Contracts and its Efficient Learning via Instrumental Regression

Authors: Shiliang Zuo

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[49] arXiv:2405.20640 [pdf, other]: Title: Heterophilous Distribution Propagation for Graph Neural Networks

Authors: Zhuonan Zheng, Sheng Zhou, Hongjia Xu, Ming Gu, Yilun Xu, Ao Li, Yuhong Li, Jingjun Gu, Jiajun Bu

Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[50] arXiv:2405.20630 [pdf, other]: Title: Stochastic Optimal Control for Diffusion Bridges in Function Spaces

Authors: Byoungwoo Park, Jungwon Choi, Sungbin Lim, Juho Lee

Subjects: Machine Learning (cs.LG)

[ total of 1194 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 1151-1194 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help (Access key information)

> cs > cs.LG

Machine Learning

Authors and titles for recent submissions

Mon, 3 Jun 2024 (showing first 50 of 166 entries)