Aprendizaje estadístico Gábor Lugosi (Universidad Pompeu Fabra) Kernel methods in a regularization framework Yoonkyung Lee (Ohio State University)
Texto completo
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12) .
(13)
(14) .
(15) . . ! "#. ! "#. ! "#. ! "#. ! "#. )*
(16) "#. )*
(17) "+,#. . $%. ' ''. (
(18)
(19) "'#. (
(20)
(21) "'#. ' 4. )*
(22) "#. -+
(23) ./0
(24) "+#.
(25) 1"+3#. 44 4, ,, ,3 33 3<. 5 678"+'# 6:;"+4#. (
(26)
(27) "'#. << <=. 6*
(28) "+#. ==. . 7 .
(29)
(30)
(31) .
(32) 1"+<#. +
(33)
(34) > !.
(35) . Aprendizaje estadístico Gábor Lugosi (Universidad Pompeu Fabra) En este curso se repasarán algunos conceptos de aprendizaje estadístico. Se investigarán los métodos de la minimización empírica del riesgo, la minimización estructural de riesgo, y algunos algoritmos de minimización de funciones convexas de riesgo, como "boosting". Entre las herramientas matemáticas esenciales se discutirán algunas desigualdades de concentración, las nociones de la dimensión Vapnik-Chervonenkis, promedios de Rademacher, etc. En la segunda parte del curso se discutirán algunas ideas básicas de la predicción online de sucesiones "individuales". Este modelo de aprendizaje está estrechamente relacionado con modelos de aprendizaje en la teoría de juegos. Se presentarán algunos resultados sobre algoritmos aleatorizados de predicción.. C2:. Kernel methods in a regularization framework Yoonkyung Lee (Ohio State University)
(36)
(37)
(38)
(39) .
(40)
(41)
(42)
(43)
(44)
(45)
(46)
(47)
(48)
(49)
(50)
(51)
(52)
(53)
(54)
(55)
(56)
(57)
(58)
(59)
(60)
(61)
(62)
(63)
(64)
(65)
(66)
(67) !"#$!
(68) . .
(69)
(70) .
(71) %
(72) &
(73)
(74)
(75)
(76)
(77)
(78)
(79)
(80)
(81)
(82)
(83)
(84)
(85)
(86)
(87)
(88)
(89) '
(90) %
(91) &
(92)
(93) ( %
(94) &
(95)
(96)
(97)
(98)
(99) (.
(100)
(101)
(102)
(103)
(104)
(105)
(106) ).
(107)
(108)
(109)
(110)
(111) (
(112) *
(113)
(114)
(115)
(116)
(117)
(118) (( (( (
(119)
(120) . .
(121)
(122)
(123)
(124)
(125)
(126)
(127)
(128) . '
(129) + , )
(130)
(131)
(132)
(133)
(134)
(135)
(136)
(137)
(138)
(139)
(140) -. .
(141) !!
(142) "
(143)
(144) "#
(145)
(146) !
(147) . %
(148) /
(149) 0.
(150) '
(151) -)
(152)
(153) 1
(154) !
(155) 2
(156)
(157) -
(158)
(159)
(160) /
(161)
(162) -
(163) 0
(164)
(165)
(166) 3
(167)
(168) %
(169) /
(170)
(171)
(172)
(173) - /
(174)
(175)
(176) - /
(177) 0
(178)
(179) /-
(180)
(181)
(182) 0.
(183)
(184) -
(185)
(186) 5
(187) - /
(188)
(189)
(190) -
(191) 6
(192) -
(193)
(194) /
(195) . . . $%&! '
(196) '"
(197)
(198) "
(199) #
(200) . !2 7
(201)
(202)
(203)
(204)
(205) 8
(206)
(207)
(208)
(209)
(210)
(211)
(212)
(213)
(214)
(215)
(216)
(217)
(218)
(219)
(220) 9
(221)
(222) : .
(223) . (
(224) '
(225) )
(226)
(227) ! *
(228) "
(229) "&
(230) ,
(231) $ %&'(
(232) . %/ - -
(233)
(234)
(235) ;
(236)
(237) - ; +
(238)
(239)
(240)
(241) <
(242)
(243)
(244)
(245) 0
(246) '
(247)
(248) /
(249) > ?
(250) <
(251) @
(252)
(253)
(254) . . '"%
(255)
(256) !!
(257) .
(258)
(259)
(260)
(261) .
(262) (
(263)
(264)
(265)
(266)
(267)
(268) ( 5 (
(269)
(270)
(271)
(272) .
(273) ( (
(274)
(275) 6 (
(276)
(277)
(278)
(279)
(280)
(281)
(282)
(283)
(284) 6 (
(285)
(286)
(287)
(288)
(289)
(290)
(291)
(292)
(293) (((
(294)
(295)
(296) A
(297) A
(298)
(299)
(300) .
(301) P6:. Adaptive estimation of linear functionals by model selection ˜ Carenne LUDENA Instituto Venezolano de Investigaciones Cient´ıficas, Venezuela e-mail: [email protected]. Abstract Consider the following model : Y (t) =< s, t > + √ L (t) , for all t ∈ H, n where H is an Hilbert space with inner product <, > and L is a centered Gaussian process over H such that for all t, u ∈ H, Cov(L(t), L(u)) =< t, u >. Model selection (MS) refers to the problem of selecting a model m leading to a least squares estimator sˆm with close to minimal risk value among a certain collection of possible models Sm ⊂ H. For each m the quadratic risk is Es − sˆm 2 = sm − s2 + dm,n where dm,n is a variance term independent of s , and sm is the projection of s over Sm . Thus the ”correct” choice of model m follows by estimating the bias term in the above bound, which can be done easily, and minimizing the sum. If, however, we consider the problem of estimating T (s), where T is some linear functional, the bias term cannot be simplified nor estimated directly and a new model selection procedure must be developed. We show the procedure is adaptive, up to a constant, and give non asymptotic inequalities for the quadratic risk. An application using a MRA for the problem of estimating a signal or its rth derivative at a given point is developed and minimax rates are seen to hold uniformly over Besov spaces. Simulations are included to illustrate the procedure’s performance.. 1.
(302) P7:. A statistical view at ill posed inverse problems and regularization methods ˜ Carenne LUDENA Instituto Venezolano de Investigaciones Cient´ıficas, Venezuela e-mail: [email protected] We are interested in recovering an unobservable signal x0 based on observations y(ti ) = A(x0 )(ti ) + εi , where A : X → Y is a compact operator with unbounded inverse, X, Y are Hilbert spaces and ti , i = 1, . . . , n is a fixed observation scheme. Although A(x0 ) ∈ Y , we cannot observe this function directly and inference on the unknown function x0 ∈ X will depend on the observed data y(ti ), i = 1, . . . , n. The ill-posedeness of the problem requires the introduction of some kind of regularization procedure, indexed by some regularization parameter α that measures the closeness of the regularized and the original (unregularized) inverse problem. Rules (and algorithms) for the choice of these regularization parameters, with or without prior information on the smoothness properties of the solution are a central issue. Our main interest will be the adaptive choice of α. For this it is necessary to introduce an appropriate penalty function. As examples of regularization procedures we consider linear model selection, Tikhonov methods and iterative procedures such as steepest descent methods. Based on concentration inequalities techniques we derive non asymptotic optimal upper bounds of the mean square error for each regularization method.. 1.
(303) Programa • Lunes 22 – 9:15– 9:30 Inscripci´on – 9:30–11:00 Aprendizaje estad´ıstico Gabor Lugosi – 11:00–11:30 Caf´e y galletas – 11:30–13:00 Kernel methods in a regularization framework Yoonkyung Lee – 13:00–14:00 Desigualdades e identidades de correlaci´on Victor P´ erez–Abreu – 14:00–16:00 Comida – 16:00–17:15 M´etodos Bayesianos para procesamiento de im´agenes Jos´ e Luis Marroqu´ın – 17:15–18:15 Quadratic Markovian probability fields for image binary segmentation. Mariano J. J. Rivera – 19:30–22:00 Taquiza en el CIMATEL • Martes 23 – 9:30–11:00 Aprendizaje estad´ıstico Gabor Lugosi – 11:00–11:30 Caf´e y galletas.
(304) – 11:30–13:00 Kernel methods in a regularization framework Yoonkyung Lee – 13:00–14:00 Adaptive estimation of linear functionals by model selection Carenne Lude˜ na – 14:00–16:00 Comida – 16:00–17:15 Algunas reflexiones sobre tipolog´ıa de modelos con motivo de nichos ecol´ogicos. Miguel Nakamura – 17:15–18:30 A statistical view of ill posed inverse problems and regularization methods. Carenne Lude˜ na • Mi´ercoles 24 – 9:30–11:00 Aprendizaje estad´ıstico Gabor Lugosi – 11:00–11:30 Caf´e y galletas – 11:30–14:00 Regularization and boosting Saharon Rosset – 14:00–16:00 Comida – 16:00–. Tarde libre.. • Jueves 25 – 9:30–11:00 Aprendizaje estad´ıstico Gabor Lugosi – 11:00–11:30 Caf´e y galletas – 11:30–14:00 Regularization and boosting Saharon Rosset – 14:00–16:00 Comida – 16:00–17:30 Kernel methods in a regularization framework Yoonkyung Lee – 17:30–18:00 Estudios de postgrado en Probabilidad y Estad´ıstica en CIMAT. Elo´ısa D´ıaz–Franc´ es.
(305) • Viernes 26 – 9:30–11:00 Aprendizaje estad´ıstico Gabor Lugosi – 11:00–11:30 Caf´e y galletas – 11:30–14:00 Data analytics for marketing decision support Saharon Rosset – 14:00–14:10 Clausura..
(306)
Documento similar
The main contribution of this paper is a new approach for the computation of the steady state flow of mono T-semiflow reducible nets. A tight bound is obtained solving a
Theorem 7 (Existence, uniqueness and bounds for Problem (11)) Assume that the hypotheses of Theorem 5 hold, consider some M > K, with K defined by Equation (44)), and let T be
Three different theoretical methods have been used: (i) molecular dynamics (MD) simulations in the framework of the density functional theory (DFT) and the density functional
In this work we deal with the problem of the trajectories corresponding to a functional whose Lagragian density is linear in the torsion function of the particle worldline trajectory
On the other hand at Alalakh Level VII and at Mari, which are the sites in the Semitic world with the closest affinities to Minoan Crete, the language used was Old Babylonian,
All these models are trained by minimizing an error term plus a regularization term, and thus they fit nicely in the domain of PMs, as the structure of the problem can be exploited
From the simulated t ¯ t sample, the total signal acceptance, including geometric acceptance and event re- construction and selection efficiencies, is found to be (23. The expected
What is perhaps most striking from a historical point of view is the university’s lengthy history as an exclusively male community.. The question of gender obviously has a major role