Share to: share facebook share twitter share wa share telegram print page

Posterior predictive distribution

In Bayesian statistics, the posterior predictive distribution is the distribution of possible unobserved values conditional on the observed values.[1][2]

Given a set of N i.i.d. observations , a new value will be drawn from a distribution that depends on a parameter , where is the parameter space.

It may seem tempting to plug in a single best estimate for , but this ignores uncertainty about , and because a source of uncertainty is ignored, the predictive distribution will be too narrow. Put another way, predictions of extreme values of will have a lower probability than if the uncertainty in the parameters as given by their posterior distribution is accounted for.

A posterior predictive distribution accounts for uncertainty about . The posterior distribution of possible values depends on :

And the posterior predictive distribution of given is calculated by marginalizing the distribution of given over the posterior distribution of given :

Because it accounts for uncertainty about , the posterior predictive distribution will in general be wider than a predictive distribution which plugs in a single best estimate for .

Prior vs. posterior predictive distribution

The prior predictive distribution, in a Bayesian context, is the distribution of a data point marginalized over its prior distribution . That is, if and , then the prior predictive distribution is the corresponding distribution , where

This is similar to the posterior predictive distribution except that the marginalization (or equivalently, expectation) is taken with respect to the prior distribution instead of the posterior distribution.

Furthermore, if the prior distribution is a conjugate prior, then the posterior predictive distribution will belong to the same family of distributions as the prior predictive distribution. This is easy to see. If the prior distribution is conjugate, then

i.e. the posterior distribution also belongs to but simply with a different parameter instead of the original parameter Then,

Hence, the posterior predictive distribution follows the same distribution H as the prior predictive distribution, but with the posterior values of the hyperparameters substituted for the prior ones.

The prior predictive distribution is in the form of a compound distribution, and in fact is often used to define a compound distribution, because of the lack of any complicating factors such as the dependence on the data and the issue of conjugacy. For example, the Student's t-distribution can be defined as the prior predictive distribution of a normal distribution with known mean μ but unknown variance σx2, with a conjugate prior scaled-inverse-chi-squared distribution placed on σx2, with hyperparameters ν and σ2. The resulting compound distribution is indeed a non-standardized Student's t-distribution, and follows one of the two most common parameterizations of this distribution. Then, the corresponding posterior predictive distribution would again be Student's t, with the updated hyperparameters that appear in the posterior distribution also directly appearing in the posterior predictive distribution.

In some cases the appropriate compound distribution is defined using a different parameterization than the one that would be most natural for the predictive distributions in the current problem at hand. Often this results because the prior distribution used to define the compound distribution is different from the one used in the current problem. For example, as indicated above, the Student's t-distribution was defined in terms of a scaled-inverse-chi-squared distribution placed on the variance. However, it is more common to use an inverse gamma distribution as the conjugate prior in this situation. The two are in fact equivalent except for parameterization; hence, the Student's t-distribution can still be used for either predictive distribution, but the hyperparameters must be reparameterized before being plugged in.

In exponential families

Most, but not all, common families of distributions are exponential families. Exponential families have a large number of useful properties. One of these is that all members have conjugate prior distributions — whereas very few other distributions have conjugate priors.

Prior predictive distribution in exponential families

Another useful property is that the probability density function of the compound distribution corresponding to the prior predictive distribution of an exponential family distribution marginalized over its conjugate prior distribution can be determined analytically. Assume that is a member of the exponential family with parameter that is parametrized according to the natural parameter , and is distributed as

while is the appropriate conjugate prior, distributed as

Then the prior predictive distribution (the result of compounding with ) is

The last line follows from the previous one by recognizing that the function inside the integral is the density function of a random variable distributed as , excluding the normalizing function . Hence the result of the integration will be the reciprocal of the normalizing function.

The above result is independent of choice of parametrization of , as none of , and appears. ( is a function of the parameter and hence will assume different forms depending on choice of parametrization.) For standard choices of and , it is often easier to work directly with the usual parameters rather than rewrite in terms of the natural parameters.

The reason the integral is tractable is that it involves computing the normalization constant of a density defined by the product of a prior distribution and a likelihood. When the two are conjugate, the product is a posterior distribution, and by assumption, the normalization constant of this distribution is known. As shown above, the density function of the compound distribution follows a particular form, consisting of the product of the function that forms part of the density function for , with the quotient of two forms of the normalization "constant" for , one derived from a prior distribution and the other from a posterior distribution. The beta-binomial distribution is a good example of how this process works.

Despite the analytical tractability of such distributions, they are in themselves usually not members of the exponential family. For example, the three-parameter Student's t distribution, beta-binomial distribution and Dirichlet-multinomial distribution are all predictive distributions of exponential-family distributions (the normal distribution, binomial distribution and multinomial distributions, respectively), but none are members of the exponential family. This can be seen above due to the presence of functional dependence on . In an exponential-family distribution, it must be possible to separate the entire density function into multiplicative factors of three types: (1) factors containing only variables, (2) factors containing only parameters, and (3) factors whose logarithm factorizes between variables and parameters. The presence of makes this impossible unless the "normalizing" function either ignores the corresponding argument entirely or uses it only in the exponent of an expression.

Posterior predictive distribution in exponential families

When a conjugate prior is being used, the posterior predictive distribution belongs to the same family as the prior predictive distribution, and is determined simply by plugging the updated hyperparameters for the posterior distribution of the parameter(s) into the formula for the prior predictive distribution. Using the general form of the posterior update equations for exponential-family distributions (see the appropriate section in the exponential family article), we can write out an explicit formula for the posterior predictive distribution:

where

This shows that the posterior predictive distribution of a series of observations, in the case where the observations follow an exponential family with the appropriate conjugate prior, has the same probability density as the compound distribution, with parameters as specified above. The observations themselves enter only in the form

This is termed the sufficient statistic of the observations, because it tells us everything we need to know about the observations in order to compute a posterior or posterior predictive distribution based on them (or, for that matter, anything else based on the likelihood of the observations, such as the marginal likelihood).

Joint predictive distribution, marginal likelihood

It is also possible to consider the result of compounding a joint distribution over a fixed number of independent identically distributed samples with a prior distribution over a shared parameter. In a Bayesian setting, this comes up in various contexts: computing the prior or posterior predictive distribution of multiple new observations, and computing the marginal likelihood of observed data (the denominator in Bayes' law). When the distribution of the samples is from the exponential family and the prior distribution is conjugate, the resulting compound distribution will be tractable and follow a similar form to the expression above. It is easy to show, in fact, that the joint compound distribution of a set for observations is

This result and the above result for a single compound distribution extend trivially to the case of a distribution over a vector-valued observation, such as a multivariate Gaussian distribution.

Relation to Gibbs sampling

Collapsing out a node in a collapsed Gibbs sampler is equivalent to compounding. As a result, when a set of independent identically distributed (i.i.d.) nodes all depend on the same prior node, and that node is collapsed out, the resulting conditional probability of one node given the others as well as the parents of the collapsed-out node (but not conditioning on any other nodes, e.g. any child nodes) is the same as the posterior predictive distribution of all the remaining i.i.d. nodes (or more correctly, formerly i.i.d. nodes, since collapsing introduces dependencies among the nodes). That is, it is generally possible to implement collapsing out of a node simply by attaching all parents of the node directly to all children, and replacing the former conditional probability distribution associated with each child with the corresponding posterior predictive distribution for the child conditioned on its parents and the other formerly i.i.d. nodes that were also children of the removed node. For an example, for more specific discussion and for some cautions about certain tricky issues, see the Dirichlet-multinomial distribution article.

See also

References

  1. ^ "Posterior Predictive Distribution". SAS. Retrieved 19 July 2014.
  2. ^ Gelman, Andrew; Carlin, John B.; Stern, Hal S.; Dunson, David B.; Vehtari, Aki; Rubin, Donald B. (2013). Bayesian Data Analysis (Third ed.). Chapman and Hall/CRC. p. 7. ISBN 978-1-4398-4095-5.

Further reading

  • Ntzoufras, Ioannis (2009). "The Predictive Distribution and Model Checking". Bayesian Modeling Using WinBUGS. Wiley. ISBN 978-0-470-14114-4.

Read other articles:

DreamWorks DragonsGenrePetualanganDrama komediFantasiAnimasiBerdasarkanHow to Train Your DragonPengembangLinda TeverbaughMike TeverbaughPengisi suaraJay BaruchelChris EdgerlyAmerica FerreraChristopher Mintz-PlasseJulie MarcusT.J. MillerNolan NorthZack PearlmanAndree VermuelenPenggubah lagu temaJohn Paesano[1]John Powell (tema)Negara asalAmerika SerikatBahasa asliBahasa InggrisJmlh. musim8Jmlh. episode118 (daftar episode)ProduksiProduser eksekutifLinda TeverbaughMike TeverbaughProduser...

 

Cục Công binhViệt Nam Cộng hòaPhù hiệuHoạt động1951 - 1975Quốc gia Việt Nam Cộng hòaPhục vụ Quân lực Việt Nam Cộng hòaPhân loạiĐơn vị Yểm trợBộ phận củaTổng cục Tiếp vận Bộ Tổng Tham mưuKhẩu hiệuCứu quốc' - Kiến quốcCác tư lệnhChỉ huynổi tiếng- Phạm Đăng Lân- Nguyễn Chấn- Nguyễn Văn Chức Thánh tổ: An Dương Vương. Công binh được gọi là một ngành trong Quân lực Vi�...

 

Kabupaten Changhua Kota Changhua, ibu kota Kabupaten Changhua Kantor Bupati Changhua DPRD Kapubaten Changhua Wei Ming-ku, Bupati Kabupaten Changhua. Kabupaten Changhua (Hanzi: 彰化縣) adalah sebuah kabupaten di Taiwan, Republik Tiongkok. Kabupaten ini terletak di Taiwan bagian tengah, secara geografis berbatasan dengan: Utara: Kota Taichung Timur: Kabupaten Nantou Selatan: Kabupaten Yunlin Barat: Selat Taiwan Kabupaten ini beribu kota di Kota Changhua. Wikimedia Commons memiliki media meng...

إيبان مايو كلينيك (بالإسبانية: Iban Mayo)‏  معلومات شخصية اسم الولادة (بالإسبانية: Iban Mayo Díez)‏  الميلاد 19 أغسطس 1977 (العمر 46 سنة)إيغوري، إسبانيا الطول 1.76 م (5 قدم 9 بوصة)* الجنسية  إسبانيا الوزن 65 كـغ (143 رطل؛ 10.2 ستون) الحياة العملية الدور دراج الفرق جيوكس تي إم س

 

Weenzen Flecken Duingen Wappen von Weenzen Koordinaten: 52° 1′ N, 9° 40′ O52.0216666666679.6725230Koordinaten: 52° 1′ 18″ N, 9° 40′ 21″ O Höhe: 230 m ü. NHN Fläche: 4,16 km² Einwohner: 362 (30. Nov. 2016)[1] Bevölkerungsdichte: 87 Einwohner/km² Eingemeindung: 1. November 2016 Postleitzahl: 31089 Vorwahl: 05185 Weenzen (Niedersachsen) Lage von Weenzen in Niedersachsen Weenzen im L...

 

Sri Lankan musician (born 1966) Jagath Wickramasinghe ජගත් වික්‍රමසිංහBorn (1966-04-10) 10 April 1966 (age 57)Kolonnawa, Sri LankaNationalitySri LankanEducationAnanda College Anuradapura Central CollegeSpouseGayani Prasangika Kohona (m. 2000)Children3ParentsJinadasa Wickramasinghe (father)Aryawathi Chandralatha (mother)Musical careerGenresPopsoulrhythm and bluesIndian classical musicInstrument(s)Vocals, guitar, tabla, sitar, violin and fluteYears active1983�...

Culture in India This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Indian tea culture – news · newspapers · books · scholar · JSTOR (October 2010) (Learn how and when to remove this template message) Cherry Resort inside Temi Tea Garden, Namchi, Sikkim Tea Garden on way to Devikulam, Kerala. Darjeeling tea pla...

 

Partido Izquierda Ciudadana de Chile Personen Partijleider Francisco Parraguez Leiva Zetels Kamer van Afgevaardigden 1 / 120 Senaat 0 / 38 Geschiedenis Opgericht 31 juli 1971 (als Izquierda Cristiana) Algemene gegevens Actief in Chili Hoofdkantoor Santo Domingo 1083, of 505,Santiago Richting Links Ideologie Democratisch socialismeChristelijk linksChristelijk socialismeHumanismeSocialisme van de 21e eeuwBolivarisme Kleuren ​ Paars Coalitie Concertación (1988-1993)Movimiento de Izquierda...

 

The examples and perspective in this article may not represent a worldwide view of the subject. You may improve this article, discuss the issue on the talk page, or create a new article, as appropriate. (March 2016) (Learn how and when to remove this template message) Celebration held on Easter Monday in Poland and Central EuropeSee also: Holi Soaking a Polish girl on śmigus-dyngus Dyngus in Wilamowice, southern Poland, where the men wander around the town in colourful handmade costumes in s...

Protein-coding gene in the species Homo sapiens NFIL3IdentifiersAliasesNFIL3, E4BP4, IL3BP1, NF-IL3A, NFIL3A, nuclear factor, interleukin 3 regulatedExternal IDsOMIM: 605327 MGI: 109495 HomoloGene: 3928 GeneCards: NFIL3 Gene location (Human)Chr.Chromosome 9 (human)[1]Band9q22.31Start91,409,045 bp[1]End91,423,832 bp[1]Gene location (Mouse)Chr.Chromosome 13 (mouse)[2]Band13 B1|13 27.68 cMStart53,121,245 bp[2]End53,135,109 bp[2]RNA expression ...

 

PT MD EntertainmentKantor pusat MD di JakartaNama dagangMD EntertainmentJenisPerseroan terbatasIndustriPerfilmanDidirikan2002; 20 tahun lalu (2002) di Jakarta, IndonesiaPendiri Dhamoo Punjabi Manoj Punjabi KantorpusatJl. Setiabudi Selatan No. 7, Jakarta Selatan, IndonesiaWilayah operasiIndonesiaTokohkunci Dhamoo Punjabi Manoj Punjabi Shania Punjabi Situs webmdentertainment.com PT MD Entertainment adalah rumah produksi yang didirikan pada tahun 2002 di Jakarta. MD Entertainment menjadi pe...

 

2024 Philadelphia PhilliesLeagueNational LeagueDivisionEastBallparkCitizens Bank ParkCityPhiladelphiaOwnersJohn S. MiddletonPresident of baseball operationsDave DombrowskiManagersRob ThomsonTelevisionNBC Sports PhiladelphiaNBC Sports Philadelphia +NBC Philadelphia (Tom McCarthy, John Kruk, Ben Davis, Mike Schmidt, Jimmy Rollins, Rubén Amaro Jr.)RadioPhillies Radio Network WIP SportsRadio 94.1 FM (English)(Scott Franzke, Larry Andersen, Kevin Stocker)WTTM (Spanish)(Danny Martinez, Bill K...

2011 video game 2011 video gameGal GunDeveloper(s)Inti CreatesPublisher(s)Alchemist (2011–12)JP/AS: Inti Creates (2021)NA/EU: PQube (2021)SeriesGal GunPlatform(s)Xbox 360PlayStation 3Nintendo SwitchWindowsReleaseXbox 360JP: January 27, 2011PlayStation 3JP: February 23, 2012Nintendo SwitchJP/AS: January 28, 2021NA/EU: February 12, 2021WindowsWW: February 12, 2021Genre(s)Bishōjo, rail shooterMode(s)Single-player Gal Gun[a], stylized as Gal★Gun, is a Japanese bishōjo rail shooter g...

 

2008 comedy film by Steven Brill Drillbit TaylorTheatrical release posterDirected bySteven BrillScreenplay by Kristofor Brown Seth Rogen Story by Edmond Dantès Kristofor Brown Seth Rogen Produced by Judd Apatow Susan Arnold Donna Arkoff Roth Starring Owen Wilson Leslie Mann Danny McBride Josh Peck CinematographyFred MurphyEdited by Brady Heck Thomas J. Nordberg Music byChristophe BeckProductioncompanyApatow ProductionsDistributed byParamount PicturesRelease date March 21, 2008 ...

 

Palace in Latvia Krāslava Old PalaceLatgale, Latvia Coat of Arms of KrāslavaKrāslava Old PalaceCoordinates55°54′07″N 27°08′55″E / 55.90194°N 27.14861°E / 55.90194; 27.14861TypePalaceSite informationConditionruins [1] Krāslava Old Palace ( Latvian: Krāslavas vecā pils, German: Kreslawka, Polish: Kraslaw ) was a Krāslava Manor house in the historical region of Latgale, in eastern Latvia. Only barely visible ruins remaining. The building housed ...

مرتفعات قره باغ   الاسم الرسمي (بالأذرية: Dağlıq Qarabağ)‏    الإحداثيات 39°48′55″N 46°45′07″E / 39.815278°N 46.751944°E / 39.815278; 46.751944  تقسيم إداري  البلد الاتحاد السوفيتي (1924–1991) أذربيجان (2020–) جمهورية آرتساخ (1991–2023)  خصائص جغرافية  المساحة 4402.979787 كيلومتر مربع  ...

 

American basketball player and businessman (born 1970) For the Oklahoma City Thunder sportscaster, see Brian Davis (sportscaster). Brian DavisDavis with Duke in 1988Personal informationBorn (1970-06-21) June 21, 1970 (age 53)Atlantic City, New JerseyNationalityAmericanListed height6 ft 7 in (2.01 m)Listed weight200 lb (91 kg)Career informationHigh schoolBladensburg (Bladensburg, Maryland)CollegeDuke (1988–1992)NBA draft1992: 2nd round, 48th overall pickSelected...

 

Street in North Oxford, England For other places with the same name, see Staverton. View east along Staverton Road from the junction with Woodstock Road Staverton Road is a residential road in North Oxford, England. Location The road runs east–west with a gentle curve halfway along.[1] At the western end of the road is a junction with Woodstock Road (A4144) and at the eastern end is a junction with Banbury Road (A4165), the two major arterial roads out of Oxford to the north. Opposi...

У этого термина существуют и другие значения, см. Менелай (значения). Менелай Александрийский Дата рождения около 70[1] Место рождения Александрия, Египет, Римская империя Дата смерти около 140[1] Место смерти Рим, Римская империя Научная сфера астрономия  Медиафай...

 

Djohar Arifin HusinAnggota Dewan Perwakilan Rakyat Republik IndonesiaPetahanaMulai menjabat 1 Oktober 2019PresidenJoko WidodoDaerah pemilihanSumatera Utara IIIKetua Umum PSSI 14Masa jabatan9 Juli 2011 – 18 April 2015PendahuluAgum Gumelar(Ketua Komite Normalisasi PSSI)PenggantiLa Nyalla Mattalitti Informasi pribadiLahir13 September 1950 (umur 73) Tanjung Pura, Langkat, Sumatera Utara, IndonesiaPartai politikPartai GerindraSuami/istriMarina HutabaratAnakSiti Faty Rahmari...

 
Kembali kehalaman sebelumnya