手机版

c ○ 2001 Kluwer Academic Publishers. Manufactured in The Ne(10)

发布时间:2021-06-07   来源:未知    
字号:

Abstract. A data cube is a popular organization for summary data. A cube is simply a multidimensional structure that contains in each cell an aggregate value, i.e., the result of applying an aggregate function to an underlying relation. In practical situat

264´ANDWUBARBARA

effects,allthepossiblek 1-factoreffects,andsoonuptothe1-factoreffectsandthe

ABmean(γ.Forexample,γiAisone-factoreffect,γijistwo-factoreffectwhichshowstheABCdependencywithinthedistributionsoftheassociatedattributesA,B,γijkisthree-factor

effectwhichshowsthedependencywithinthedistributionsoftheassociatedattributes

ABCA,B,C.Notethesizeofpossiblehigh-factoreffectscanbeverylarge(forγijk,thesize

isI×J×K).

ABACADBClogy ijkl=γ+γiA+γjB+γkC+γlD+γij+γik+γil+γjk

BDCDABCABDACDBCDABCD+γjl+γkl+γijk+γijl+γikl+γjkl+γijkl(2)

Therearealsosomelinearconstraintstoovercometheoverparameterization,asshowninEq.(3).

γ.A=γ.B=γ.C=γ.D=0

ABACACCDγiAB=0.=γ.j=γi.=γ.k=···=γ.l

...

ABCDABCDγijk=γij=γiABCD=γ.ABCD=0.kljkl..l(3)whereadot“.”meansthattheparameterhasbeensummedovertheindex(Forexample, 1ABABγi.=J

j=0γij).Inshort,theconstraintsspecifythattheloglinearparameterssumto0overallindices.

Therearetwoapproachestoestimatethemodelcoef cients.Oneistheiterativepropor-tional ttingmethod,basedonsolvingthecorrespondinglikelihoodequations.Thismethodcanalwaysgettheprecisesolutions,butitneedsmanyiterativestepsoverthedata.

Theothermethodistocomputethecoef cientsfromthevaluesdirectly.Thecoef cientscorrespondingtoanygroup-byGareobtainedbysubtractingfromaveragelvalueatgroup-byGallthecoef cientsfromhigherlevelgroup-by-s.Forinstance,Eq.(4)showstheparametersina4-dimensionaltable.

γ=l....

γiA=li... γ.

...

ABγij=lij.. γiA γjB γ.

ABCABACBCγijk=lijk. γij γik γjk γiA γjB γkC γ.

...(4)

wherel....isthegrandmeanoraverage(Notethata“·”denotesanaggregationalongthatdimension.)andli...isthemeanoverallvaluesalongithmemberofdimensionA.Froml....andli...,γiAdenoteshowmuchtheaverageofthevaluesalongithmemberofdimensionAdiffersfromtheoverallaverage.

Sarawagietal.(1998)presentfastcomputationtechniquesthatmakethisapproachfeasibleforlargesets,andalthoughthismethoddoesnotgiveprecisesolutions,itsfaster

c ○ 2001 Kluwer Academic Publishers. Manufactured in The Ne(10).doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印
×
二维码
× 游客快捷下载通道(下载后可以自由复制和排版)
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
注:下载文档有可能出现无法下载或内容有问题,请联系客服协助您处理。
× 常见问题(客服时间:周一到周五 9:30-18:00)