• No se han encontrado resultados

A simple and fast representation space for classifying complex time series

N/A
N/A
Protected

Academic year: 2020

Share "A simple and fast representation space for classifying complex time series"

Copied!
8
0
0

Texto completo

(1)

A

simple

and

fast

representation

space

for

classifying

complex

time

series

Luciano Zunino

a,b,

,

Felipe Olivares

c

,

Aurelio

F. Bariviera

d

,

Osvaldo

A. Rosso

e,f,g

aCentrodeInvestigacionesÓpticas(CONICETLaPlataCIC),C.C.3,1897 Gonnet,Argentina

bDepartamentodeCienciasBásicas,FacultaddeIngeniería,UniversidadNacionaldeLaPlata(UNLP),1900LaPlata,Argentina cInstitutodeFísica,PontificiaUniversidadCatólicadeValparaíso(PUCV),23-40025 Valparaíso,Chile

dDepartmentofBusiness,UniversitatRoviraiVirgili,Av. Universitat 1,43204Reus,Spain

eInstitutodeFísica,UniversidadeFederaldeAlagoas(UFAL),BR104Nortekm97,57072-970,Maceió,Alagoas,Brazil

fInstitutoTecnológicodeBuenosAires(ITBA)andCONICET,C1106ACD,Av.EduardoMadero399,CiudadAutónomadeBuenosAires,Argentina

gComplexSystemsGroup,FacultaddeIngenieríayCienciasAplicadas,UniversidaddelosAndes,Av.Mons.ÁlvarodelPortillo12.455,LasCondes,Santiago,Chile

a

b

s

t

r

a

c

t

Keywords: Timeseriesanalysis Abbevalue Turningpoints Financialdata

Electroencephalogramdata Heartratevariability

Inthecontextoftimeseriesanalysisconsiderableefforthasbeendirectedtowardstheimplementation of efficient discriminatingstatistical quantifiers. Very recently, asimple and fastrepresentationspace hasbeenintroduced,namelythenumberofturningpointsversustheAbbevalue.Itisabletoseparate timeseriesfromstationaryandnon-stationaryprocesseswithlong-rangedependences.Inthisworkwe showthatthisbidimensionalapproachisusefulfordistinguishingcomplextimeseries:differentsetsof financialandphysiologicaldataareefficientlydiscriminated.Additionally,amultiscalegeneralizationthat takesintoaccountthemultipletimescalesofteninvolvedincomplexsystemshasbeenalsoproposed. Thismultiscaleanalysisisessentialtoreachahigherdiscriminativepowerbetweenphysiologicaltime seriesinhealthanddisease.

1. Introduction

Typically, time series of measured variables are employed to analyzethe dynamical behavior of complex systems. These tem-poral recordsneed tobe suitably characterizedin orderto reach a more reliable comprehension of the underlying nature of the phenomenon of interest. Obviously, this understanding is essen-tialformodelingandforecastingpurposes.Inparticular,numerous algorithmsforquantifyingthedisorderandcomplexityoftime se-riesgeneratedfromnonlineardynamicalsystemshavebeen devel-oped.Withoutbeingexhaustive,wecanmentionLempel–Ziv com-plexity[1],correlation dimension[2,3],Lyapunovexponents[4,5], Kolmogorov[6],approximate[7],sample [8]andpermutation [9]

entropies,fractal [10] and multifractal[11] measures, and statis-ticalcomplexity [12]. Moreover, combinations of these measures havebeenalsoproposedespeciallyfordiscriminatingand

classify-*

Correspondingauthorat:CentrodeInvestigacionesÓpticas(CONICETLaPlata– CIC),C.C.3,1897 Gonnet,Argentina.

E-mailaddresses:[email protected](L. Zunino),[email protected]

ing dynamical systems.The usefulnessofthesemultidimensional schemeshasbeen confirmedforheterogeneous goalssuch asthe distinction betweennoise andchaos [13,14], the characterization ofalanguagecorpus[15],thequantificationoffinancialmarket ef-ficiency[16,17],the automaticdetectionofepilepticseizure from electroencephalograms [18], the discrimination of songs in mas-sive databases [19], and the classification of cardiac signals [10, 20,21]andtexture images[22], pointingout onlya few ofmany applications.Despitetheexistingcontributions,characterizing the underlyingdynamicsofcomplexsystemfromtimeseriesisstilla challengingproblemofcurrentresearch.

Tarnopolskihasveryrecentlyintroducedarepresentationspace byplottingtwostatisticalfeaturesassociatedwithtimeseries:the Abbevalue andthenumberofturningpoints [23].Numerical re-alizationsofstationaryandnon-stationarylong-rangedependence stochastic processes are successfully discriminated in this plane. Moreprecisely,fractionalBrownianmotion(fBm),fractional Gaus-siannoise(fGn),anddifferentiatedfGn(dfGn)werefoundtoform distinct branchesintheproposed space. Inthiswork, we goone step further by showing that this bidimensional scheme can be used as a discriminator of dynamics. Analysis of financial and physiological time series have been included for illustrating the

(2)

robustness of the technique when dealing with real time series. Amultiscalegeneralization,inspiredbythemultiscaleentropy al-gorithmproposed by Costa etal.[24],is introducedforunveiling hiddeninformationoverdifferentlevelsoftemporalresolution of the original signal. The higher discriminativepower atparticular timescalesobservedinthephysiologicalapplicationsconfirmsthe advantages of implementingthe proposed multiscale analysis. As it will be shown below, our results demonstrate that the Abbe valueandthenumberofturningpointsaretwodistinctivefeatures for identifying differences in complex systems dynamics. Conse-quently,thelocationintherepresentationspace,thatresultsfrom computingsimultaneously both quantifiers, deserves special con-siderationfortimeseriesclassificationpurposes.

The remainder of this paper is structured as follows. In Sec-tion2,theTarnopolski’sdiagramtogetherwiththeproposed mul-tiscalerecipeandacoupleofbenchmark testsare discussed.The performanceofthemethodasadiagnostictoolisanalyzedin Sec-tion3throughseveralreal-worldapplications.Finally,inSection4, themainresultsandconclusionsofthisworkaresummarized.

2. Methods

2.1. Tarnopolskiplane

Givingatime series

{

xi

}

i=n 1,theAbbevalue,denoted

A

inthis

paper,isdefinedashalfoftheratioofthemeansquaresuccessive differencetothevariance,

A

=

n

2

(

n

1

)

n−1

i=1

(

xi+1

xi

)

2

n

i=1

(

xi

− ¯

x

)

2

(1)

with

¯

xthemeanof

{

xi

}

[25–27].TheAbbestatisticquantifiesthe

smoothness of a time series: it is close to zero for time series displayingahighdegree ofsmoothnesswhileit tends toone for whitenoise[23].Accordingtoourknowledge,veryfewworkshave implementedthismeasureforpracticalapplications.Withinthese few exceptions,theAbbe value andanotherrelatedmeasure,the excess Abbevalue, have beensuccessfullyapplied in stellar vari-abilitystudiesforidentifyingtransientsinlarge-scalesurveys[28]. A turning point in a time series is observed when the mid-dle value xi of a sequence of three consecutive observations is

lowerorhigherthantheothertwovalues,xi−1 andxi+1,that

sur-roundit[29].Equalvalues,i.e. xj

=

xkfor j

=

k,areneglected.This

assumption isjustified whenever

{

xi

}

ni=1 hasa continuous

distri-bution.From an arbitrarytime series theprobability of findinga turningpoint, denoted by

T

,can be empiricallyestimatedby its relative frequency.Inparticular,

T

isasymptotically equal to2

/

3 for random time series. It is important to stress here that esti-matingtheturningpointsprobabilityisequivalenttocalculatethe zerocrossingrate(ZCR)ofthedifferentiated timeseries.ZCR has beenpreviouslyimplementedfordiverseapplications,e.g.the de-tection of voiced andunvoiced sounds in speech signals[9] and theautomaticdiagnosisoftonic-clonicepilepticseizures[30].The probability of finding a turning point is also linked withordinal patterns.Indeed,estimating

T

isequivalent tocalculatethe rela-tivefrequencyoffourofthesixpossiblemotifswhenembedding dimension D

=

3 isconsidered(pleaseseepermutation indices2, 3,4and5inFig. 2aofRef.[21]).

Tarnopolski introduced a model representation spaceby plot-tingthe fractionof turning points ofa time series versus its as-sociatedAbbe value.This

T

vs

A

diagramisabletodiscriminate fBm, fGn anddfGn (please see Fig. 5 of Ref. [23]). Moreover, an invertible relationship is found between

A

and the Hurst expo-nent H. This functional form has been then used for estimating the Hurst exponentof several real world data. Briefly, the Hurst exponent H is a scaling exponent that measures the long-range

dependence intime series.Furtherdetails about H can be found inRefs.[31,32]. ForillustratingtheabilityoftheTarnopolskiplane tocharacterizelong-rangedependenceintimeseries,wehave an-alyzed the location of generic 1

/

noisesin thisbidimensional scheme.InFig. 1a), wedepictthepositionofcolorednoiseswith

α

rangingfrom

1to3instepsofsize0.1.Averageandstandard deviation(SD)(displayedaserrorbars)ofestimated

A

and

T

val-uesforonehundredindependentrealizationsoflengthn

=

214for each

α

exponenthavebeenplotted. TheFourierFilteringMethod (FFM)hasbeenimplementedinMatlab forgeneratingthese long-range power-law correlated time series. In the FFM, the Fourier components of an uncorrelated sequence of Gaussian-distributed random numbers are filtered with a suitable power-law filter in order to introduce correlationsamong the variables. We address the readerto Refs. [33,34] for moredetails about thisalgorithm. Some examples of these artificial long-range correlated time se-riesareshowninFig. 1b).Itcanbeconcludedthatcolorednoises with

α

between

1 and 1 are more noisy andbetter discrimi-natedbytheAbbevalue.Whereas,whenthepower-lawexponent isbetween1and3,the fractionofturningpoints ismore appro-priate fordistinguishing betweenthem. We have also confirmed that avery similar evolutionintheTarnopolski plane isfollowed bylonger1

/

artificialtimeseries(n

=

100

,

000).Asexpected,in thiscase,shorterSDerrorbarsareobtained.

2.2. Multiscaleanalysis

Itiswidelyrecognizedthattimeseriesarisingfromsome rep-resentative variable of nonlinear complex systems have a multi-scale nature, i.e. the observed dynamicsis often strongly depen-dent on the resolution scale used to sample the signal. For il-lustrating this multiscale phenomenon, we consider the analysis of time series derived fromnonlinear dynamics ina numerically controlled situation.More precisely, we estimate

A

and

T

from realizationsofthex-variableoftheLorenzsystem:

˙

x

=

σ

(

y

x

),

y

˙

=

x

(

ρ

z

)

y

,

˙

z

=

xy

β

z

.

(2)

Following the example included in Ref. [23], time series of length n

=

214data points weregeneratedwithinitial conditions

(

x0

,

y0

,

z0

)

=

(

1

,

5

,

10

)

, and standard parameters

σ

=

10,

ρ

=

28 and

β

=

8

/

3 for which thesystemexhibits chaotic behavior. The time series were numerically integrated by using the Matlab’s

ode45 function, that implements fourth and fifth order Runge– Kutta numerical integration algorithms, with an integration step equalto0.001.Samplingperiods

δ

t rangingfrom0.001to1witha

stepequal to0.001are considered.Weanalyzed timeserieswith

n

=

214 datapoints foreach

δ

t.Thefirst 105 iterations were

dis-cardedtoavoidpossibletransients.Theevolutionofthelocationin theTarnopolskiplaneoftheseonethousandnumericalrealizations oflengthn

=

214withdifferenttemporalresolutionsisdepictedin Fig. 2.It is worth remarkinghere that avery similar behavior is obtainedbyanalyzinglongernumericalrealizations(n

=

100

,

000). Ontheonehand,forlowvaluesof

δ

t,anartificialregularbehavior

(3)

Fig. 1.a)LocationintheTarnopolskirepresentationspaceofgeneric1/fαnoiseswithα ∈ {−1, 0.9, ..., 3}.Onehundrednumericalrealizationsoflengthn=214 data pointsforeachαexponentweregeneratedwithFFM.Meanandstandarddeviation(displayedaserrorbars)ofestimatedvaluesforbothquantifiers,AandT,overthese onehundredsimulationsareshown.Positionsofquantifiersmovegraduallyfromtherightuppercornertotheleftbottomcornerasαincreases.Horizontalandvertical dashedlinesindicatethetheoreticalvalueofquantifiersforwhitenoisestochasticprocesses.b)Someexamplesofthenumericallygeneratedlong-rangepower-lawlinearly correlatedtimeseries.Only103datapointsareplottedforabettervisualization.

Multiscale entropy [24] and scale-dependent Lyapunov expo-nents [35] are two generalized quantifiers introduced precisely withtheaimofcharacterizing thedifferentsignalbehaviorsona widerangeofscales simultaneously.Bearinginmindtheinherent scale-dependentnature ofthephysiologicalsystems wewill ana-lyzeinthenextsection,amultiscaletoolisproposedby construct-ingcoarse-grainedtimeseriesatmultipletemporalscalesand esti-matingthestatisticalquantifiers

A

and

T

foreachofthese trans-formedtimeseries.Coarse-grainedsequencesarereconstructedby followingthesameprocedureintroducedinRef.[24].Thatis,the original record of length n is divided into non-overlapping seg-ments of length

τ

, and the mean value is calculated for each segmentgeneratingsmoothingsequences

{

j

}

oflength

n

/

τ

,

j

=

1

τ

i=(j−1+1

xi

,

1

j

n

/

τ

(3)

with

n

/

τ

thelargestintegernotgreaterthann

/

τ

.Theanalysisof thelocationofbothquantifiers,

A

and

T

,intheTarnopolskiplane asa function of the temporal scale factor

τ

allows detection of intrinsiccomplexstructuresacross multipletemporal resolutions.

Furthermore,theidentificationofoptimaltimescalesforthe clas-sification ofcomplexsystemscould beachievedbyimplementing thisapproach.Forcheckingtheusefulnessofthismultiscalerecipe, we have analyzed an oversampled long numerical realization of thex-variableoftheLorenzsystemwiththesameparametersand initial conditionsdetailedpreviously.Thechosen samplingperiod was

δ

t

=

0

.

001 and the time series length n

=

224. A multiscale

analysis withtemporal scale factors 1

τ

1000 hasbeen per-formed.Evolutionofthelocationofquantifiersforthesetemporal scalesisdepictedinFig. 3(bluecurve).Theevolutionobtainedfor theoriginal analysisfordifferentsamplingperioddeveloped pre-viously is also plotted (black curve) for the sake of comparison. Overall, the results are qualitatively similar. The observed quan-titative differences are attributedto the non-overlapping moving averagefilter(Eq.(3))proposedforconstructingthecoarse-grained timeseries.

3. Real-worldapplications

(4)

Fig. 2.LocationintheTarnopolskiplaneofonethousandnumericalrealizationsof thex-variableoftheLorenzsystem(σ=10,ρ=28,

β

=8/3,and

(

x0, y0, z0) = (1, 5, 10))withdifferent samplingperiods(δt∈ {0.001, 0.002, ..., 1}).Timeseries

oflengthn=214 datapointsweregeneratedwithinitialconditions

(

x0, y0, z0) = (1, 5, 10).Positionsofquantifiersmovegraduallyfromthe leftbottomcornerto therightuppercorneras

δ

t increases.Simulationsofthex-variableforthe two

extremesamplingperiods,i.e.δt=0.001 and

δ

t=1,areshowninthelowerand

upperinsets,respectively.Asimilarevolutionhasbeenconfirmedforlonger nu-mericalrealizations(n=100,000).

Fig. 3.EvolutionoflocationintheTarnopolskiplaneforanoversampled(δt=0.001)

and long(n=224)numerical realizationofthe x-variable ofthe Lorenz system (σ=10,ρ=28,

β

=8/3,and

(

x0, y0, z0) =(1, 5, 10))byimplementinga multi-scaleanalysiswithtemporalscalefactor τ rangingfrom1to1000(bluecurve). Positionsofquantifiersmovegraduallyfromtheleftbottomcornertotheright up-percornerasτ increases.Theoriginalevolutionobtainedfordifferentsampling periodsisalsodisplayed(blackcurve)foreasycomparison.(Forinterpretationof thereferencestocolorinthisfigurelegend,thereaderisreferredtotheweb ver-sionofthisarticle.)

thesereal signalsare often contaminated by noise and other ar-tifacts.Thus, theextractionofmeaningfulinformationfromthem isusuallyachallengingtask.Takingthisintoaccount,nextwewill test theefficiencyof theTarnopolski representationspacein real situations.

3.1. Efficiencyofdeveloped,emergingandfrontierstockmarkets

It is well-known that a stock market is considered efficient whenever its prices follow a random walk. That is, the incre-ments ofthe prices should be independentand obey a Gaussian distribution.However, it is alsowidely acceptedthat this isonly an idealized first approximation, and deviations from thiswhite noise model, violating either the independence or Gaussian as-sumptions,have beenfound inmany empiricalstudies since the

revolutionary paperof BenoitMandelbrot [36]. In particular, cor-relatedmarkets open thedoorto arbitrageopportunitiesbecause past prices can help topredict future prices.Indeed, it hasbeen shown that emerging markets have greater correlations than de-velopedones,suggestingmorepredictability[37,38].Moreover,the Hurst exponent has been widely proposed to quantify the stock marketefficiency[39–43].

GiventhelinkfoundbetweentheHurstexponentandthetwo statisticalquantifiersthatdefinetheTarnopolskiplane,wepropose to use thisrepresentationspace todistinguish the stage ofstock marketdevelopment.Withthisaiminmind,weanalyzetheprice returns offorty-eightstock market indicesfordifferentcountries. Codesandnames oftheseindices,collected fromtheDatastream platform ( http://financial.thomsonreuters.com/en/products/tools- applications/trading-investment-tools/datastream-macroeconomic-analysis.html), are detailedin Table 1. Daily prices beginning on January3,2000andendingonMay27,2016areconsidered(4,280 observations).FollowingtheclassificationprovidedbytheMorgan StanleyCapitalIndex(MSCI)methodology(https :/ /www.msci .com), thereare twentydeveloped,seventeenemergingandeleven fron-tierstockmarkets.

Locations inthe

T

vs

A

diagram ofthe dailypricereturnsof the forty-eight stock market indices are plotted in Fig. 4. Mean and standard deviation (displayed as error bars) of one thou-sand numericalindependentrealizationsoffGnwithHurst expo-nents H

∈ {

0

.

05

,

0

.

10

,

...,

0

.

95

}

andthesamelengthofthereturns (n

=

4

,

279) are alsoshown. These simulations were obtainedby consecutivedifferencesoffBmgeneratedviatheMATLABfunction

wfbm. On the one hand, it is observed that positions of indices associated withdevelopedcountries(blue circles) are,inaverage, closer to the idealefficiency point, i.e.

A

1 and

T

2

/

3, that correspondstowhitenoise.Particularly,pricereturnsofthe Amer-icanstockmarketindex(Standard&Poor’s500)show an antiper-sistent behavior and, according to its location in the Tarnopolski plane, could be modeled asa fGn with H

=

0

.

45. On the other hand,pricereturnindicesofstockmarketfromemergingcountries (green triangles) havelowerestimatedvaluesforbothquantifiers confirming the presence of persistency in their dynamics. There are twoexceptions,namelyThailand(

A

0

.

987,

T

0

.

661)and Turkey(

A

1

.

006,

T

0

.

662),locatedwithintheregionof max-imum efficiency.Finally, pricereturnindicesof frontiercountries (redsquares)canbe consideredasthemostinefficientsincethey are more distant with respect to the white noise location. The presence of strong long-range correlations in their dynamics is themainreasonofthisinefficientlocation.Argentina(

A

0

.

945,

(5)

Table1

CodesandnamesofthestockmarketindicesdownloadedfromDatastream.Developed,emergingandfrontierstock mar-kets(classifiedfollowingtheMSCImethodology)aredenotedasD,EandF,respectively.

Country Datastream code Index MSCI classification

1. Netherlands AMSTEOE AEX INDEX (AEX) D 2. Jordan AMMANFM AMMAN SE FINANCIAL MARKET F 3. Argentina ARGMERV ARGENTINA MERVAL F 4. Greece GRAGENL ATHEX COMPOSITE E 5. Austria ATXINDX ATX – AUSTRIAN TRADED INDEX D 6. Thailand BNGKSET BANGKOK S.E.T. E

7. Belgium BGBEL20 BEL 20 D

8. Turkey TRKISTB BIST NATIONAL 100 E 9. Hungary BUXINDX BUDAPEST (BUX) E 10. Chile IGPAGEN CHILE SANTIAGO SE GENERAL (IGPA) E 11. Sri Lanka SRALLSH COLOMBO SE ALL SHARE F 12. Croatia CTCROBE CROATIA CROBEX F 13. Germany DAXINDX DAX 30 PERFORMANCE D 14. Egypt EGHFINC EGYPT HERMES FINANCIAL E

15. France FRCAC40 FRANCE CAC 40 D

16. United Kingdom FTSE100 FTSE 100 D 17. Malaysia FBMKLCI FTSE BURSA MALAYSIA KLCI E

18. Italy FTSEMIB FTSE MIB INDEX D

19. South Africa JSEOVER FTSE/JSE ALL SHARE E

20. Hong Kong HNGKNGI HANG SENG D

21. Hong Kong HKHCHAF HANG SENG CHINA AFFILIATED CORP D 22. Hong Kong HKHCHIE HANG SENG CHINA ENTERPRISES D

23. Spain IBEX35I IBEX 35 D

24. Indonesia JAKCOMP IDX COMPOSITE E 25. Ireland ISEQUIT IRELAND SE OVERALL (ISEQ) D

26. Israel ISTA100 ISRAEL TA 100 D

27. Pakistan PKSE100 KARACHI SE 100 F 28. Kenya NSEINDX KENYA NAIROBI SE (NSE20) F 29. Korea KORCOMP KOREA SE COMPOSITE (KOSPI) E

30. Lebanon LBBLOMI LEBANON BLOM F

31. Germany MDAXIDX MDAX FRANKFURT D 32. Mexico MXIPC35 MEXICO IPC (BOLSA) E 33. Oman OMANMSM AN MUSCAT SECURITIES MKT. F 34. Denmark COSEASH OMX COPENHAGEN (OMXC) D 35. Finland HEXINDX OMX HELSINKI (OMXH) D 36. Sweden SWSEALI OMX STOCKHOLM (OMXS) D 37. Estonia ESTALSE OMX TALLINN (OMXT) F 38. Philippine PSECOMP PHILIPPINE SE I(PSEi) E 39. Czech Republic CZPXIDX PRAGUE SE PX E 40. Romania RMBETRL ROMANIA BET (L) F 41. Russia RSMICEX RUSSIAN MICEX INDEX E 42. USA S&PCOMP S&P 500 COMPOSITE D 43. China CHSASHR SHANGHAI SE A SHARE E 44. China CHZBSHR SHENZHEN SE B SHARE E 45. Switzerland SWISSMI SWISS MARKET (SMI) D 46. Taiwan TAIWGHT TAIWAN SE WEIGHED TAIEX E

47. Japan TOKYOSE TOPIX D

48. Tunisia TUTUNIN TUNISIA TUNINDEX F

May27,2016).ComparisonoftheperformanceoftheTarnopolski representationspacewithother implementedtools for character-izingfinancialdataefficiency,suchasentropy-related methodolo-gies[44–47] andtheEfficiencyIndex[48–50],is beyondthe pur-poseofthepresentworkandwillbestudiedinafutureresearch.

3.2.Electroencephalogramsfromhealthyandepilepticpatients

Discriminationofbrainelectricalactivityfromdifferentregions and from different physiological and pathological brain states is obviouslyaveryrelevantissuesincethisinformationcouldbe po-tentiallyuseful formedical diagnosis. Motivatedby thisfact, we analyzefivedifferentsetsofelectroencephalogram(EEG)time se-riesfordifferentgroupsandrecordingregions:surface(scalp)EEG recordingsfromfivehealthyvolunteersinanawakestatewitheyes open(Set A) andclosed(Set B), intracranial EEGrecordingsfrom five epilepsy patients during the seizure free interval from out-side(Set C) andfromwithin (Set D)the seizure generatingarea, andintracranialEEGrecordingsofepilepticseizures(SetE).These

artifact-free records are available under www.meb.unibonn.de/ epileptologie/science/physik/eegdata.html.Thesamplingrateofthe datawas173.61Hz.OnehundredsinglechannelEEGsegmentsof 23.6 s ofduration(4,097datapoints)foreachoneofthefivesets ofdatahavebeenconsidered.Thesesegmentswereselectedfrom continuousmultichannelEEGrecordingsaftervisualinspectionfor artifacts(e.g.muscleactivityoreyemovements).Additionally,only segments that satisfy a weak stationarity criterion were chosen. Further details about the recording technique of theseEEG data canbefoundintheoriginalpaperbyAndrzejak etal.[51].

(6)

Fig. 4.Top:Position ofthepricereturnsofthe forty-eightstockmarket indices intheTarnopolskiplane.Developed,emergingandfrontiercountriesareidentified bybluecircles,greentrianglesandredsquares,respectively.LocationoffGnwith HurstexponentsH∈ {0.05, 0.10, ..., 0.95}andthesamelengthofthepricereturns (n=4,279)hasbeenalsoincluded.Beingmoreprecise,meanandstandard devia-tion(displayedaserrorbars)ofonethousandindependentrealizationsaredepicted inblackcolor.AcontinuousblackcurvejoiningthemeanvaluesforfGnwith differ-entHisincludedforvisualreference.Estimatedvaluesforbothquantifiersdecrease astheHurstexponentoffGnincreases.Bottom:Enlargementforabetterviewof locationsassociatedwithstockmarketindices.(Forinterpretationofthereferences tocolorinthisfigurelegend,thereaderisreferredtothewebversionofthis arti-cle.)

confirmedthat animproved classificationis reachedfor interme-diate resolution scales. In such a case not only healthy (Sets A and B)andpathologicalgroups(SetsC,DandE)arediscriminated, but also differences between interictal (Sets C and D) and ictal activities (Set E) are achieved. Regrettably, interictal epileptiform activities fromtheepileptogenic zone(Set D) andthose found at recordingsitesdistantfromtheepileptogenic zone(SetC)cannot be separated. Last but not least, for theseintermediate temporal scales,thetwohealthygroupsareclearlydistinguished.This differ-entiationbetweentheconditionsofeyesclosedandeyesopenhas notbeenreachedbypreviousimplementedalgorithmsmuchmore complicated, both computationally and conceptually [18,51–53]. Weconsiderthisaveryinteresting findingsinceitiswell-known that alpha waves in a frequency rangeof 8–13 Hz are predomi-nant in relaxed healthy subjects with eyes closed while broader frequencycharacteristicsareobtainedforopeneyes[51].

3.3. Physiologicalandpathologicalbeat-to-beatintervals

Inthis application, heart ratevariability(HRV) ofhealthy and pathological subjects have been analyzed. HRV time series are derived from electrocardiogram signals by measuring

consecu-Fig. 5.LocationofthefivesetsofEEGdataintheTarnopolskiplanefordifferent temporalscalefactors:a)τ=1,b)τ=4,c)τ=7,andd)τ=10.Meanand stan-darddeviation(displayedaserrorbars)ofbothquantifiers,AandT,fortheone hundredsinglechannelEEGsegmentsassociatedwitheachsetareplotted.Groups aredifferentiatedbycolor:SetAinblack,SetBingray,SetCinblue,SetDincyan, andSetEinred.(Forinterpretationofthereferencestocolorinthisfigurelegend, thereaderisreferredtothewebversionofthisarticle.)

Fig. 6.LocationofthethreegroupsofHRVdataintheTarnopolskiplanefor differ-enttemporalscalefactors:a)τ=1,b)τ=5,c)τ=10,d)τ=15,e)τ=20,and f)τ=30.Meanandstandarddeviation(displayedaserrorbars)ofbothquantifiers, AandT,forthefiveBBItimeseriesassociatedwitheachsetareplotted.Groups aredifferentiatedbycolor:NSRinblack,CHFinblue,andAFinred.(For interpre-tationofthereferencestocolorinthisfigurelegend,thereaderisreferredtothe webversionofthisarticle.)

(7)

for

τ

∈ {

1

,

5

,

10

,

15

,

20

,

30

}

. By comparing the location of these groupsin the Tarnopolski plane for the different scale factor, an improvedclassificationisvisuallyconcludedforintermediate tem-poralscales. Beingmoreprecise, thebest separationofthegiven datasetsof thethree groupsisachieved for

τ

=

5.Higher values forbothquantifiersareobtainedfortheAFgroup.Thatis, dynam-icsinHRVofAFsubjectsaremoreirregularandnoisythanthose relatedtoNSRandCHFsubjects,whichisinagreementwith previ-ousfindings[24,54–56].Moreover,itshouldbealsostressed,that for intermediate scale factors (

τ

∈ {

5

,

10

,

15

,

20

}

) cardiovascular dynamicsfromCHFpatientshave,inaverage,lowerestimated val-uesfor

A

and

T

thanthose obtainedfromhealthysubjects.This factis reflectinga more regulardynamics fortheCHF pathologi-cal state,which isconsistent withresultsderived earlierthrough fractalanalysis[10,54,55].

We demonstrate that the Tarnopolski diagram is able to effi-cientlyseparatethethreedifferentgroupsatparticulartimescales, evidencing once againthe usefulness of the proposed multiscale analysis. We should emphasize here that this is a very simple qualitativeanalysis, andstudies withlarger datasetsare required toconfirmthereliabilityoftheproposedmultiscalebidimensional schemeascardiacbiomarker. Particularly, itis worth notinghere thatlocationsofthethreeHRVgroupsdonotseemtofollowthe transition from more regular to more noisy dynamics as

τ

in-creases.Thisisacuriousfindingevenmoreifwetakeintoaccount that resultsobtainedfromEEG time series(please see Fig. 5) do followtheexpectedbehavior.Thereasonsbehindsuchunexpected behaviorarenotclearforusandwillbeinvestigatedthrough fur-therresearchbyanalyzinglargerbeat-to-beatintervalsdatasets.

4. Conclusions

Theperformanceofabidimensionalscheme,obtainedby plot-tingthefractionofturningpointsversustheAbbevalueassociated withatimeseries,hasbeentestedforclassificationpurposes. Sev-eralpracticalapplicationsallowustoassessitsreliability.A multi-scale algorithm is also introduced for capturing relevant aspects of the complex dynamics at different temporal resolutions. We demonstrate that this new multiscale description is required to accuratelydistinguishbetweenphysiologicaltime seriesinhealth anddisease. We guess that the outstanding classification results shownby theTarnopolskiplaneinthepracticalapplicationshave their rootintheexcellent capabilityforseparatingcolored noises (please see the high discriminative power of colored noisesand fGnstochastic processes inFig. 1 andFig. 4, respectively). Obvi-ously, theoreticalarguments to supportthis heuristicobservation arenecessaryandwillbethemainaimofafuturestudy.We con-siderthatresultsobtainedarequiteencouragingandjustifyfurther analysisinother researchfieldsfortestingthepotentialityofthis multiscalebidimensionalapproachasadiscriminativetool.

Acknowledgements

We thank three anonymous reviewers for their careful read-ing of our manuscript and their many insightful comments and suggestions,which greatly helped to improvethe manuscript. LZ and OAR acknowledge financial support from Consejo Nacional deInvestigaciones CientíficasyTécnicas(CONICET),Argentina. FO thankssupportfromPontificiaUniversidadCatólicadeValparaíso.

References

[1] A.Lempel,J.Ziv,Onthecomplexityoffinitesequences,IEEETrans.Inf.Theory 22 (1)(1976)75–81,http://dx.doi.org/10.1109/TIT.1976.1055501.

[2] P.Grassberger,I. Procaccia,Characterization ofstrangeattractors,Phys. Rev. Lett.50 (5)(1983)346–349,http://dx.doi.org/10.1103/PhysRevLett.50.346.

[3] P.Grassberger,Doclimaticattractorsexist?,Nature323(1986)609–612,http:// dx.doi.org/10.1038/323609a0.

[4] A.Wolf,J.B.Swift,H.L.Swinney,J.A.Vastano,DeterminingLyapunovexponents fromatimeseries,PhysicaD16 (3)(1985)285–317,http://dx.doi.org/10.1016/ 0167-2789(85)90011-9.

[5] M.T.Rosenstein,J.J. Collins,C.J.De Luca,A practicalmethodfor calculating largest Lyapunovexponentsfromsmalldatasets,PhysicaD65 (1–2)(1993) 117–134,http://dx.doi.org/10.1016/0167-2789(93)90009-P.

[6] P.Grassberger, I. Procaccia, Estimation ofthe Kolmogorov entropy from a chaotic signal,Phys.Rev.A28 (4)(1983)2591(R),http://dx.doi.org/10.1103/ PhysRevA.28.2591.

[7] S.M.Pincus,Approximateentropyasameasureofsystemcomplexity,Proc. Natl. Acad.Sci.USA88 (6)(1991)2297–2301,http://dx.doi.org/10.1073/pnas. 88.6.2297.

[8]J.S. Richman,J.R.Moorman,Physiologicaltime-seriesanalysisusing approxi-mateentropyandsampleentropy,Am.J.Physiol.,HeartCirc.Physiol.278 (6) (2000)H2039–H2049.

[9] C.Bandt,B.Pompe, Permutationentropy:anaturalcomplexitymeasurefor timeseries,Phys.Rev.Lett.88 (17)(2002)174102,http://dx.doi.org/10.1103/ PhysRevLett.88.174102.

[10] C.-K.Peng, S. Havlin,H.E.Stanley,A.L.Goldberger,Quantificationofscaling exponentsandcrossoverphenomenainnonstationaryheartbeattimeseries, Chaos5 (1)(1995)82–87,http://dx.doi.org/10.1063/1.166141.

[11] P.C. Ivanov, L.A.N.Amaral, A.L. Goldberger,S. Havlin,M.G. Rosenblum, Z.R. Struzik,H.E.Stanley,Multifractalityinhumanheartbeatdynamics,Nature399 (1999)461–465,http://dx.doi.org/10.1038/20924.

[12] R. López-Ruiz, H.L. Mancini, X. Calbet, A statistical measure of com-plexity, Phys. Lett. A 209 (5–6) (1995) 321–326, http://dx.doi.org/10.1016/ 0375-9601(95)00867-5.

[13] O.A.Rosso,H.A.Larrondo,M.T.Martin,A.Plastino,M.A.Fuentes, Distinguish-ingnoisefromchaos,Phys.Rev.Lett.99 (15)(2007)154102,http://dx.doi.org/ 10.1103/PhysRevLett.99.154102.

[14] F.Olivares,A.Plastino,O.A.Rosso,Contrastingchaoswithnoisevialocal ver-susglobalinformationquantifiers,Phys.Lett.A376 (19)(2012)1577–1583, http://dx.doi.org/10.1016/j.physleta.2012.03.039.

[15] O.A.Rosso,H. Craig,P.Moscato,ShakespeareandotherEnglishRenaissance authorsascharacterizedbyInformationTheorycomplexityquantifiers,Physica A388 (6)(2009)916–926,http://dx.doi.org/10.1016/j.physa.2008.11.018. [16] L. Zunino,M.Zanin,B.M. Tabak,D.G.Pérez,O.A.Rosso,Complexity-entropy

causality plane: a useful approach to quantify the stock market ineffi-ciency,PhysicaA389 (9)(2010)1891–1901,http://dx.doi.org/10.1016/j.physa. 2010.01.007.

[17] L.Zunino,A.F.Bariviera,M.B.Guercio,L.B.Martinez,O.A.Rosso,Onthe effi-ciencyofsovereignbondmarkets,Physica391 (18)(2012)4342–4349,http:// dx.doi.org/10.1016/j.physa.2012.04.009.

[18] J.B. Gao, J. Hu, W.W. Tung, Facilitating joint chaos and fractal analysis of biosignalsthroughnonlinearadaptivefiltering,PLoSONE6 (9)(2011)e24331, http://dx.doi.org/10.1371/journal.pone.0024331.

[19] H.V.Ribeiro,L.Zunino,R.S.Mendes,E.K.Lenzi,Complexity-entropycausality plane:ausefulapproachfordistinguishing songs,PhysicaA391 (7) (2012) 2421–2428,http://dx.doi.org/10.1016/j.physa.2011.12.009.

[20] N.Wessel,M.Riedl,J.Kurths,Isthenormalheartrate“chaotic”dueto respi-ration?,Chaos19 (2)(2009)028508,http://dx.doi.org/10.1063/1.3133128. [21] U.Parlitz,S. Berg,S.Luther, A.Schirdewan, J.Kurths,N. Wessel,Classifying

cardiacbiosignalsusingordinalpatternstatisticsandsymbolicdynamics, Com-put.Biol.Med.42 (3)(2012)319–327,http://dx.doi.org/10.1016/j.compbiomed. 2011.03.017.

[22] L.Zunino,H.V.Ribeiro,Discriminatingimagetextureswiththemultiscale two-dimensional complexity-entropy causality plane,Chaos Solitons Fractals 91 (2016)679–688,http://dx.doi.org/10.1016/j.chaos.2016.09.005.

[23] M.Tarnopolski,OntherelationshipbetweentheHurstexponent,theratioof themeansquaresuccessivedifferencetothevariance,andthenumberof turn-ing points,PhysicaA 461(2016) 662–673,http://dx.doi.org/10.1016/j.physa. 2016.06.004.

[24] M.Costa,A.L.Goldberger,C.-K.Peng,Multiscaleentropyanalysisofcomplex physiologictimeseries,Phys.Rev.Lett.89 (6)(2002)068102,http://dx.doi.org/ 10.1103/PhysRevLett.89.068102.

[25] J.vonNeumann,Themeansquaresuccessivedifference,Ann.Math.Stat.12 (2) (1941)153–162,http://dx.doi.org/10.1214/aoms/1177731746.

[26] J. von Neumann, Distribution of the ratio of the mean square successive difference to thevariance, Ann. Math. Stat.12 (4) (1941) 367–395,http:// dx.doi.org/10.1214/aoms/1177731677.

[27] M.G.Kendall,Studiesinthehistoryofprobabilityandstatistics.XXVI:thework ofErnstAbbe,Biometrika 58 (2)(1971)369–373,http://dx.doi.org/10.2307/ 2334525.

[28] N.Mowlavi,Searchingtransientsinlarge-scalesurveys–amethodbasedon theAbbevalue,Astron.Astrophys.568(2014)A78,http://dx.doi.org/10.1051/ 0004-6361/201322648.

(8)

1028 L. Zunino et al. / Physics Letters A 381 (2017) 1021–1028

[30] I.Conradsen,S.Beniczky,K.Hoppe,P.Wolf,H.B.D.Sorensen,Automated al-gorithm for generalizedtonic-clonic epilepticseizure onsetdetection based onsEMGzero-crossingrate,IEEETrans.Biomed.Eng.59 (2)(2012)579–585, http://dx.doi.org/10.1109/TBME.2011.2178094.

[31]H.E.Hurst,Thelong-termstoragecapacityofreservoirs,Trans.Am.Soc.Civ. Eng.116(1951)770–799.

[32] B.B.Mandelbrot,J.W.vanNess,FractionalBrownianmotions,fractionalnoises andapplications,SIAMRev.10 (4)(1968)422–437,http://dx.doi.org/10.1137/ 1010093.

[33] H.A.Makse,S.Havlin,M.Schwartz,H.E.Stanley,Methodforgenerating long-rangecorrelationsfor largesystems,Phys. Rev.E 53 (5)(1996) 5445–5449, http://dx.doi.org/10.1103/PhysRevE.53.5445.

[34] M.Gómez-Extremera,P.Carpena,P.C.Ivanov,P.A.Bernaola-Galván,Magnitude andsignoflong-rangecorrelatedtimeseries:decompositionandsurrogate sig-nalgeneration,Phys. Rev.E 93 (4)(2016)042201, http://dx.doi.org/10.1103/ PhysRevE.93.042201.

[35] J.B.Gao,J.Hu,W.W.Tung,Y.H.Cao,Distinguishingchaosfromnoiseby scale-dependent Lyapunov exponent, Phys. Rev. E 74 (6) (2006) 066204, http:// dx.doi.org/10.1103/PhysRevE.74.066204.

[36]B.Mandelbrot,Thevariationofcertainspeculativeprices,J.Bus.36 (4)(1963) 394–419.

[37] M.Be¸ben,A.Orłowski, Correlationinfinancialtime series:established ver-susemergingmarkets,Eur.Phys.J.B20 (4)(2001)527–530,http://dx.doi.org/ 10.1007/s100510170233.

[38] T.DiMatteo,T.Aste,M.M.Dacorogna,Scalingbehaviorsindifferently devel-opedmarkets,PhysicaA324 (1–2)(2003)183–188,http://dx.doi.org/10.1016/ S0378-4371(02)01996-9.

[39] D.O.Cajueiro,B.M.Tabak,TheHurstexponentovertime:testingtheassertion thatemergingmarketsarebecomingmoreefficient,PhysicaA336 (3–4)(2004) 521–537,http://dx.doi.org/10.1016/j.physa.2003.12.031.

[40] D.O. Cajueiro, B.M. Tabak, Ranking efficiency for emerging markets,Chaos SolitonsFractals22 (2)(2004)349–352,http://dx.doi.org/10.1016/j.chaos.2004. 02.005.

[41] D.O.Cajueiro, B.M.Tabak,RankingefficiencyforemergingmarketsII,Chaos SolitonsFractals23 (2)(2005)671–675,http://dx.doi.org/10.1016/j.chaos.2004. 05.009.

[42] L. Zunino,B.M. Tabak,D.G.Pérez, M. Garavaglia,O.A.Rosso, Inefficiencyin Latin-Americanmarketindices,Eur.Phys.J.B60 (1)(2007)111–121,http:// dx.doi.org/10.1140/epjb/e2007-00316-y.

[43] C.Eom,S.Choi,G.Oh,W.-S.Jung,Hurst exponentandpredictionbasedon weak-formefficientmarkethypothesisofstockmarkets,PhysicaA387 (18) (2008)4630–4636,http://dx.doi.org/10.1016/j.physa.2008.03.035.

[44] E.Martina,E.Rodriguez,R.Escarela-Perez,J.Alvarez-Ramirez,Multiscale en-tropyanalysisofcrudeoilpricedynamics,EnergyEcon.33 (5)(2011)936–947, http://dx.doi.org/10.1016/j.eneco.2011.03.012.

[45] J.Alvarez-Ramirez,E.Rodriguez,J.Alvarez,Amultiscaleentropyapproachfor marketefficiency, Int.Rev.Financ. Anal.21(2012) 64–69,http://dx.doi.org/ 10.1016/j.irfa.2011.12.001.

[46] A.Ortiz-Cruz,E.Rodriguez,C.Ibarra-Valdez,J.Alvarez-Ramirez,Efficiencyof crudeoilmarkets:evidencesfrominformationalentropyanalysis,Energy Pol-icy41(2012)365–373,http://dx.doi.org/10.1016/j.enpol.2011.10.057. [47] L.Zunino,A.F.Bariviera,M.B.Guercio,L.B.Martinez,O.A.Rosso,Monitoringthe

informationalefficiencyofEuropeancorporatebondmarketswithdynamical permutationmin-entropy,PhysicaA456(2016)1–9,http://dx.doi.org/10.1016/ j.physa.2016.03.007.

[48] L.Kristoufek,M.Vosvrda,Measuringcapitalmarketefficiency:globaland lo-calcorrelationsstructure,PhysicaA392 (1)(2013)184–193,http://dx.doi.org/ 10.1016/j.physa.2012.08.003.

[49] L.Kristoufek,M.Vosvrda,Measuringcapitalmarketefficiency:long-term mem-ory,fractaldimensionandapproximateentropy,Eur.Phys.J.B87 (7)(2014) 162,http://dx.doi.org/10.1140/epjb/e2014-50113-6.

[50] L. Kristoufek,M. Vosvrda,Commodityfutures andmarket efficiency,Energy Econ.42(2014)50–57,http://dx.doi.org/10.1016/j.eneco.2013.12.001. [51] R.G.Andrzejak,K.Lehnertz,F.Mormann,C.Rieke,P.David,C.E.Elger,

Indi-cationsofnonlinear deterministicand finitedimensionalstructures intime seriesofbrainelectricalactivity:dependenceonrecordingregionandbrain state,Phys. Rev.E 64 (6)(2001)061907, http://dx.doi.org/10.1103/PhysRevE. 64.061907.

[52] H. Adeli,S. Ghosh-Dastidar, N. Dadmehr,A wavelet-chaos methodology for analysisofEEGsandEEGsubbandstodetectseizureandepilepsy,IEEETrans. Biomed. Eng. 54 (2) (2007) 205–211, http://dx.doi.org/10.1109/TBME.2006. 886855.

[53] L.Zunino,F.Olivares,O.A.Rosso,Permutationmin-entropy:animproved quan-tifierforunveilingsubtletemporalcorrelations,Europhys.Lett.109 (1)(2015) 10005,http://dx.doi.org/10.1209/0295-5075/109/10005.

[54] A.L.Goldberger,L.A.N.Amaral,J.M.Hausdorff,P.C.Ivanov,C.-K.Peng,H.E. Stan-ley,Fractaldynamicsinphysiology:alterationswithdiseaseandaging,Proc. Natl.Acad.Sci.USA99 (Suppl.1)(2002)2466–2472,http://dx.doi.org/10.1073/ pnas.012579499.

[55] J.Alvarez-Ramirez,E.Rodriguez,J.C.Echeverría,Delaysinthehumanheartbeat dynamics,Chaos19 (2)(2009)028502,http://dx.doi.org/10.1063/1.3152005. [56] J. Hu, J. Gao, W.W. Tung, Characterizing heart rate variability by

Referencias

Documento similar

Number of samples (N), range, mean, standard deviation (SD), and coefficient of variation (CV) of the quality parameters for the different training and

75  Table 6.6 Summary of the mean contact angle value for water as a measurement liquid and its standard deviation measured by the DSA-100 for the superhydrophobic sample

Figure 1 presents the results for the mean error of the estimated distance after 2000 iterations for each of the three methods (the first one, the simple triangulation

In this guide for teachers and education staff we unpack the concept of loneliness; what it is, the different types of loneliness, and explore some ways to support ourselves

For a comparison with the mean values which are obtained by averaging over one period of the frequency u, the minimum and maximum values (P2) „and (P2);„are shown in Fig. In

Arithmetic mean, RMSE and standard deviation associated with the error of the calculation of CCT, Ra and Rf for 2·10 4 random SPDs when sensor measurements are calculated

The values in the table 2 are average values of the surface roughness parameters and their standard deviation measured for hornbeam wood in three different altitude classes..

Rain gauge and ground radar measurements are limited to land, and the retrieval of quantitative precipitation estimates from satellite has several problems including the indirectness