Notice
Random Tessellation Forests
- document 1 document 2 document 3
- niveau 1 niveau 2 niveau 3
Descriptif
Random forests are a popular class of algorithms used for regression and classification. The original algorithm introduced by Breiman in 2001 and many of its variants are ensembles of randomized decision trees built from axis-aligned partitions of the feature space. One such variant, called Mondrian random forests, were proposed to handle the online setting and are the first class of random forests for which minimax rates were obtained in arbitrary dimension. However, the restriction to axis-aligned splits fails to capture dependencies between features, and oblique splits have shown improved empirical performance for many tasks. By viewing the Mondrian as a special case of the stable under iterated (STIT) process in stochastic geometry, we resolve some open questions about the generalization of split directions. In particular, we utilize the theory of stationary random tessellations to show that STIT random forests achieve minimax rates for Lipschitz and C^2 functions. This work opens many new questions at the intersection of stochastic geometry and machine learning. Based on joint work with Ngoc Tran.
Thème
Dans la même collection
-
Sofic entropy of processes on infinite random trees
BordenaveCharlesThis is a joint work with Agnes Backhausz et Balasz Szegedy. We define a natural notion of micro-state entropy...
-
The bias of fluid approximation: Poisson equation, averaging methods and two-timescale processes
GastNicolasFluid approximation often provide a good tool to study a stochastic process.
-
On the dependence structure of negatively dependent measures
Barzegar TouchaniMiladA major breakthrough in the theory of negatively dependent – i.e., repulsive – probability measure...
-
A Notion of Entropy for Sparse Marked Graphs and its Applications in Graphical Data Compression
DelgoshaPayamMany modern data sources arising from social networks, biological data, etc. are best viewed...
-
Propagation of chaos and Poisson Hypothesis for replica mean-field models of intensity-based neural…
DavydovMichelNeural computations arising from myriads of interactions between spiking neurons can be modeled as network dynamics with punctuate interactions.
-
Back and forth between the beta distribution and edge stochastic domination in ERAPs
D'AchilleMatteoI will survey recent and less recent results on the phase diagram of Euclidean Random Assignment Problems (ERAPs)...
-
Fluctuations of random convex interfaces
CalkaPierreWe consider the convex hull of a point set constituted with independent and uniformly distributed points in a smooth convex body K of R^d.
-
The unreasonable effectiveness of determinantal processes
GhoshSubhroshekharIn 1960, Wigner published an article famously titled “The Unreasonable Effectiveness of Mathematics in the Natural Sciences.
-
Structural properties of a conditioned random walk on the integer lattice with local constraints
FossSergueiWe consider a random walk on a one/multidimensional integer lattice with random bounds on local times, conditioned on the event that it hits a high level before its death.
-
Forward and backward limits
ThorissonHermannWe start by considering irreducible aperiodic positive recurrent Markov chains, and the proof of the main limit theorem...
-
-
Perfect simulation of interacting point processes with infinite interaction range
LöcherbachEvaI will present a (partly) new method for perfectly sampling from the stationary regime of point processes having an infinite number of interacting components.
Avec les mêmes intervenants et intervenantes
-
Optimal Convex and Nonconvex Regularizers for a Data Source
O'ReillyElizaRegularization is a widespread technique used in statistical estimation problems that helps to capture low dimensional structure in the data and improve signal recovery.
Sur le même thème
-
Bruit, erreur, anomalie et incertitude dans les données-PUDD
RossiFabriceLes données collectées sont systématiquement soumises à des perturbations de diverses natures, depuis le bruit de mesure de capteurs jusqu’aux erreurs de saisie.
-
Combinatorial maps in high genus
LoufBaptisteCombinatorial maps are a model of discrete geometry: they are surfaces made by gluing polygons along their sides, or equivalently, graphs drawn on surfaces. In this talk, I'll focus on the study of
-
Do there exist expanders with non-negative curvature ?
SalezJustinIn this talk I will briefly recall the framework of local weak limits of finite graphs introduced by I. Benjamini and O. Schramm
-
Tail bounds for detection times in mobile hyperbolic graphs
MitscheDieterMotivated by Krioukov et al.'s model of random hyperbolic graphs for real-world networks, and inspired by the analysis of a dynamic model of graphs in Euclidean space by Peres et al., we introduce a
-
Sofic entropy of processes on infinite random trees
BordenaveCharlesThis is a joint work with Agnes Backhausz et Balasz Szegedy. We define a natural notion of micro-state entropy...
-
Online matching for the multiclass Stochastic Block Model
SOPRANO LOTONahuelA matching in a graph is a set of edges that do not share endpoints. Developing algorithms that find large matchings is an important problem. An algorithm is said to be online if it has to construct
-
Critical cluster cascades
KirchnerMatthiasWe consider a sequence of Poisson cluster point processes...
-
Point processes on higher rank symmetric spaces and their cost
MellickSamuelCost is a natural invariant associated to group actions and invariant point processes on symmetric spaces (such as Euclidean space and hyperbolic space). Informally, it measures how difficult it is to
-
Parallel server systems in extended heavy traffic
CastielEyalThe standard setting for studying parallel server systems (PSS) at the diffusion scale is based on the heavy traffic condition (HTC)...
-
The Maximal Agreement Subtree problem for random trees
BudzinskiThomasConsider two binary trees whose leaves are labelled from 1 to n.
-
An Improved Lower Bound on the Largest Common Subtree of Random Leaf-Labeled Binary Trees
KhezeliAliIt is known that the size of the largest common subtree...
-
Reversible Markov decision processes
AnantharamVenkatA Markov decision process is called reversible if for every stationary Markov control strategy the resulting Markov chain is reversible.