We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions

[ total of 1194 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 1151-1194 ]
[ showing 50 entries per page: fewer | more | all ]

Mon, 3 Jun 2024 (showing first 50 of 166 entries)

[1]  arXiv:2405.21064 [pdf, other]
Title: Recurrent neural networks: vanishing and exploding gradients are not the end of the story
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
[2]  arXiv:2405.21063 [pdf, other]
Title: Neural Network Verification with Branch-and-Bound for General Nonlinearities
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[3]  arXiv:2405.21061 [pdf, other]
Title: Graph External Attention Enhanced Transformer
Comments: In Proceedings of ICML 2024
Subjects: Machine Learning (cs.LG)
[4]  arXiv:2405.21060 [pdf, other]
Title: Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
Authors: Tri Dao, Albert Gu
Comments: ICML 2024
Subjects: Machine Learning (cs.LG)
[5]  arXiv:2405.21046 [pdf, other]
Title: Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (stat.ML)
[6]  arXiv:2405.21045 [pdf, ps, other]
Title: An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction
Subjects: Machine Learning (cs.LG)
[7]  arXiv:2405.21043 [pdf, other]
Title: Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
Journal-ref: Proceedings of the 41 st International Conference on Machine Learning, 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[8]  arXiv:2405.21042 [pdf, other]
Title: Comparing information content of representation spaces for disentanglement with VAE ensembles
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG)
[9]  arXiv:2405.21036 [pdf, ps, other]
Title: A-PETE: Adaptive Prototype Explanations of Tree Ensembles
Subjects: Machine Learning (cs.LG)
[10]  arXiv:2405.21021 [pdf, other]
Title: Beyond Conventional Parametric Modeling: Data-Driven Framework for Estimation and Prediction of Time Activity Curves in Dynamic PET Imaging
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Dynamical Systems (math.DS)
[11]  arXiv:2405.21018 [pdf, other]
Title: Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[12]  arXiv:2405.21012 [pdf, other]
Title: G-Transformer for Conditional Average Potential Outcome Estimation over Time
Subjects: Machine Learning (cs.LG); Methodology (stat.ME)
[13]  arXiv:2405.21003 [pdf, other]
Title: Explaining Predictions by Characteristic Rules
Comments: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022
Journal-ref: In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13713. Springer, Cham (2023)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[14]  arXiv:2405.20988 [pdf, other]
Title: Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
[15]  arXiv:2405.20986 [pdf, other]
Title: Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2405.20984 [pdf, other]
Title: Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Comments: Forty-first International Conference on Machine Learning (ICML), 2024
Subjects: Machine Learning (cs.LG)
[17]  arXiv:2405.20973 [pdf, other]
Title: LCQ: Low-Rank Codebook based Quantization for Large Language Models
Authors: Wen-Pu Cai, Wu-Jun Li
Comments: 10 pages, 5 figures
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[18]  arXiv:2405.20971 [pdf, other]
Title: Amortizing intractable inference in diffusion models for vision, language, and control
Comments: Code: this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[19]  arXiv:2405.20954 [pdf, other]
Title: Aligning Multiclass Neural Network Classifier Criterion with Task Performance via $F_β$-Score
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[20]  arXiv:2405.20935 [pdf, other]
Title: Effective Interplay between Sparsity and Quantization: From Theory to Practice
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[21]  arXiv:2405.20933 [pdf, ps, other]
Title: Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[22]  arXiv:2405.20915 [pdf, other]
Title: Fast yet Safe: Early-Exiting with Risk Control
Comments: 25 pages, 11 figures, 4 tables (incl. appendix)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[23]  arXiv:2405.20905 [pdf, other]
Title: VENI, VINDy, VICI: a variational reduced-order modeling framework with uncertainty quantification
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Dynamical Systems (math.DS)
[24]  arXiv:2405.20882 [pdf, other]
Title: Sheaf HyperNetworks for Personalized Federated Learning
Comments: 25 pages, 12 figures, 7 tables, pre-print under review
Subjects: Machine Learning (cs.LG)
[25]  arXiv:2405.20879 [pdf, other]
Title: Flow matching achieves minimax optimal convergence
Subjects: Machine Learning (cs.LG)
[26]  arXiv:2405.20860 [pdf, other]
Title: Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Subjects: Machine Learning (cs.LG)
[27]  arXiv:2405.20838 [pdf, other]
Title: einspace: Searching for Neural Architectures from Fundamental Operations
Comments: Project page at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[28]  arXiv:2405.20835 [pdf, other]
Title: Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[29]  arXiv:2405.20824 [pdf, ps, other]
Title: Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[30]  arXiv:2405.20821 [pdf, other]
Title: Pursuing Overall Welfare in Federated Learning through Sequential Decision Making
Comments: Accepted at ICML 2024
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
[31]  arXiv:2405.20800 [pdf, other]
Title: Shape Constraints in Symbolic Regression using Penalized Least Squares
Subjects: Machine Learning (cs.LG); Symbolic Computation (cs.SC)
[32]  arXiv:2405.20794 [pdf, ps, other]
Title: Model Interpretation and Explainability: Towards Creating Transparency in Prediction Models
Subjects: Machine Learning (cs.LG)
[33]  arXiv:2405.20790 [pdf, other]
Title: Intersectional Unfairness Discovery
Comments: ICML-2024 Camera-ready
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[34]  arXiv:2405.20772 [pdf, ps, other]
Title: Reinforcement Learning for Sociohydrology
Subjects: Machine Learning (cs.LG); Computers and Society (cs.CY)
[35]  arXiv:2405.20763 [pdf, other]
Title: Improving Generalization and Convergence by Enhancing Implicit Regularization
Comments: 35 pages
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
[36]  arXiv:2405.20761 [pdf, other]
Title: Share Your Secrets for Privacy! Confidential Forecasting with Vertical Federated Learning
Comments: Submitted to the 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2024)
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC)
[37]  arXiv:2405.20759 [pdf, other]
Title: Information Theoretic Text-to-Image Alignment
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[38]  arXiv:2405.20738 [pdf, other]
Title: Federated Random Forest for Partially Overlapping Clinical Data
Subjects: Machine Learning (cs.LG)
[39]  arXiv:2405.20724 [pdf, other]
Title: Learning on Large Graphs using Intersecting Communities
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
[40]  arXiv:2405.20692 [pdf, other]
Title: In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[41]  arXiv:2405.20690 [pdf, other]
Title: Unleashing the Potential of Diffusion Models for Incomplete Data Imputation
Subjects: Machine Learning (cs.LG)
[42]  arXiv:2405.20685 [pdf, other]
Title: Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[43]  arXiv:2405.20678 [pdf, ps, other]
Title: No-Regret Learning for Fair Multi-Agent Social Welfare Optimization
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
[44]  arXiv:2405.20677 [pdf, other]
Title: Provably Efficient Interactive-Grounded Learning with Personalized Reward
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[45]  arXiv:2405.20671 [pdf, other]
Title: Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers
Comments: 73 pages, 20 figures, 90 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[46]  arXiv:2405.20664 [pdf, other]
Title: Weak Robust Compatibility Between Learning Algorithms and Counterfactual Explanation Generation Algorithms
Authors: Ao Xu, Tieru Wu
Subjects: Machine Learning (cs.LG)
[47]  arXiv:2405.20652 [pdf, other]
Title: Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs
Comments: Published as a conference paper at ICML 2024
Subjects: Machine Learning (cs.LG)
[48]  arXiv:2405.20642 [pdf, other]
Title: Principal-Agent Multitasking: the Uniformity of Optimal Contracts and its Efficient Learning via Instrumental Regression
Authors: Shiliang Zuo
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[49]  arXiv:2405.20640 [pdf, other]
Title: Heterophilous Distribution Propagation for Graph Neural Networks
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[50]  arXiv:2405.20630 [pdf, other]
Title: Stochastic Optimal Control for Diffusion Bridges in Function Spaces
Subjects: Machine Learning (cs.LG)
[ total of 1194 entries: 1-50 | 51-100 | 101-150 | 151-200 | ... | 1151-1194 ]
[ showing 50 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2406, contact, help  (Access key information)