• No se han encontrado resultados

Algorithm 9 Energy estimation procedure for individual frequencies

1: procedure EstimateValues(X,χ, L, n, kb 0, k1, δ, p)

2: B ← 1200δ k0k1 3: F ← 10 log1δ

4: G ← (n, B, F )-flat filter . See Definition 2.1

5: F ← {f ∈ [n] : round(kf

1) ∈ L}

6: T ← 10 log2p

7: fort ∈ {1, . . . , T } do

8: ∆ ← uniform random sample from [kn

1]

9: σ ← uniform random sample from odd numbers in [kn

1]

10: U ← HashToBins(X,b χ, G, n, B, σ, ∆)b . of(f ), h(f ) in Definition 4.2

11: Wf(t) ← bG−1o

f(f )Ubh(f )ω−σ∆f for eachf ∈ F

12: Wf ← Mediant(x(t)f ) for each f ∈ F . Separately for the real and imaginary parts

13: returnW

Once we have located the blocks, we need to estimate the frequency values with them. The function EstimateValues in Algorithm 9 performs this task for us via basic hashing techniques.

The following lemma characterizes the guarantee on the output.

Lemma 5.2 (Re-stated from Section 5.1) For any integers(n, k0, k1), list of block indices L, param-etersδ ∈ n1,201

and p ∈ (0, 1/2), and signals X ∈ Cn andχ ∈ Cb n with k bX −χkb 2poly(n)1 kχkb 2, the outputW of the function EstimateValues(X,χ, L, n, kb 0, k1, δ, p) has the following property:

X

f ∈S

j∈LIj

|Wf − ( bX −χ)b f|2 ≤ δ|L|

3k0k bX −χkb 22

with probability at least 1 − p, where Ij is the j-th block. Moreover, the sample complexity is O(k0δk1 log1plog1δ), and if kχkb 0 = O(k0k1), then the runtime is O(k0δk1log1plog1δlog n+k1·|L| log1p).

Proof. LetX0 = X − χ, and let bU be the output of HashToBins and bU its exact counterpart. We start by calculatingWf(t) for an arbitraryf ∈ F:

Wf(t) = bG−1o (σ, ∆) implicitly depending on t.

Bounding err(t)f and errf(t)f : Using Lemma G.1 in Appendix G, we have wherec is used in HashToBins, and c0 is the exponent in the poly(n) notation of the assumption k bX −χkb 2poly(n)1 kχkb 2.

In order to characterize |Wf(t)− bXf0|2, we use the following:

|errff|2+ 2|err(t)f | · |errf(t)f | ≤ 4n2(−c+c0)k bX0k22+ 4n−c+c0k bX0k2· |err(t)f |. (117) which follows directly from (116).

Characterizing |Wf(t)− bXf0|2: We have from (114), (115), and (117) that where the last line follows by writing E

|err(t)f |

≤ q

E

|err(t)f |2

via Jensen’s inequality, and then applying (115).

Taking the median: Recall that Wf is the median of T independent random variables, with the median taken separately for the real and imaginary parts. Since |W |2 = |Re(W )|2+ |Im(W )|2, we find that (119) holds true whenWf(t)− bXf0 is replaced by its real or imaginary part. Hence, with probability at least1 − T /2T  1

6t

T /2

, we have |Re(Wf(t)− bXf0)|2 < 160tB k bX0k22, and analogously for the imaginary part. Combining these and applying the union bound, we obtain

P

where we first applied T /2T 

≤ 2T, and then the choice T = 10 log2p from Algorithm 9 and the choice ofp ≤ 1/2.

We now bound the error as follows:

|Wf − bXf0|2 ≤ 320

We write the expected value of the second term as Eh |Wf− bXf0|2−320 where we applied the change of variablev = 1 + 320 u

B k bX0k22. By incorporating (120) into this integral,

where the second line is by explicitly evaluating the integral (withT > 2), and the third by T /2−1 ≥ 4 (cf., Algorithm 9).

Summing (121) over F = ∪j∈LIj, we find that the total error is upper bounded as follows:

X

with probability at least1 − p, by Markov’s inequality. The lemma now follows by the choice B =

1200

δ k0k1 in Algorithm 9.

Sample complexity and runtime: To calculate the sample complexity, note that the only opera-tion in the algorithm that takes samples is the call to HashToBins. By Lemma 4.6, and the choices B = 1200k0δk1 andF = 10 log1δ, the sample complexity isO(k0δk1 log1δ) per hashing performed. Since we run the hashing in a loop10 log2p times, this amounts to a total ofO(k0δk1log1δlogp1).

The runtime depends on two operations. The first one is calling HashToBins, whose analysis follows similarly to the aforementioned sample complexity analysis using the assumption kχkb 0 = O(k0k1), but with an extra log n factor compared to the sample complexity, as per Lemma 4.6.

The other operation is computation ofWf(t), which takes unit time for eachf ∈ F. Since the size of |F |= k1· |L|, running it in a loop costs O(k1· |L| logp1). Computing the median is done in linear time which consequently results in |F |T = O(k1· |L| log1p).

References

[AGS03] A. Akavia, S. Goldwasser, and S. Safra, “Proving hard-core predicates using list decoding,”

IEEE Symp. Found. Comp. Sci. (FOCS), vol. 44, pp. 146–159, 2003.

[Aka10] A. Akavia, “Deterministic sparse Fourier approximation via fooling arithmetic progres-sions,” Conf. Learn. Theory (COLT), pp. 381–393, 2010.

[Bac10] F. R. Bach, “Structured sparsity-inducing norms through submodular functions,” in Adv.

Neur. Inf. Proc. Sys. (NIPS), 2010, pp. 118–126.

[BBC14] B. Bah, L. Baldassarre, and V. Cevher, “Model-based sketching and recovery with ex-panders,” in ACM-SIAM Symp. Disc. Alg. (SODA), 2014.

[BBC+16] L. Baldassarre, N. Bhan, V. Cevher, A. Kyrillidis, and S. Satpathi, “Group-sparse model selection: Hardness and relaxations,” IEEE Trans. Inf. Theory, vol. 62, no. 11, pp. 6508–

6534, November 2016.

[BCDH10] R. Baraniuk, V. Cevher, M. Duarte, and C. Hegde, “Model-based compressive sensing,”

IEEE Trans. Inf. Theory, vol. 56, no. 4, pp. 1982–2001, April 2010.

[BCG+12] P. Boufounos, V. Cevher, A. C. Gilbert, Y. Li, and M. J. Strauss, “What’s the frequency, Kenneth?: Sublinear Fourier sampling off the grid,” RANDOM/APPROX, 2012.

[BCW10] R. Baraniuk, V. Cevher, and M. B. Wakin, “Low-dimensional models for dimensionality reduction and signal recovery: A geometric perspective,” Proc. IEEE, vol. 98, no. 6, pp.

959–971, 2010.

[CIHB09] V. Cevher, P. Indyk, C. Hegde, and R. Baraniuk, “Recovery of clustered sparse signals from compressive measurements,” in Int. Conf. Samp. Theory Apps. (SAMPTA), 2009.

[CT06a] E. Candes and T. Tao, “Near-optimal signal recovery from random projections: Universal encoding strategies,” IEEE Trans. Inf. Theory, vol. 52, no. 12, pp. 5406–5425, 2006.

[CT06b] T. M. Cover and J. A. Thomas, Elements of Information Theory. John Wiley & Sons, Inc., 2006.

[DIPW10] K. Do Ba, P. Indyk, E. Price, and D. P. Woodruff, “Lower bounds for sparse recovery,”

ACM-SIAM Symp. Disc. Alg. (SODA), 2010.

[EHC15] M. El Halabi and V. Cevher, “A totally unimodular view of structured sparsity,” in Int.

Conf. Art. Intel. Stats. (AISTATS), May 2015.

[FR13] S. Foucart and H. Rauhut, A Mathematical Introduction to Compressive Sensing.

Springer New York, 2013.

[GGI+02] A. Gilbert, S. Guha, P. Indyk, M. Muthukrishnan, and M. Strauss, “Near-optimal sparse Fourier representations via sampling,” ACM Symp. Theory Comp. (STOC), 2002.

[GHI+13] B. Ghazi, H. Hassanieh, P. Indyk, D. Katabi, E. Price, and L. Shi, “Sample-optimal average-case sparse fourier transform in two dimensions,” in Allerton Conf. Comm., Con-trol, and Comp., 2013.

[GIIS14] A. Gilbert, P. Indyk, M. Iwen, and L. Schmidt, “Recent developments in the sparse Fourier transform: A compressed Fourier transform for big data,” IEEE Sig. Proc. Mag., vol. 31, no. 5, pp. 91–100, 2014.

[GL89] O. Goldreich and L. Levin, “A hard-corepredicate for allone-way functions,” ACM Symp.

Theory Comp. (STOC), pp. 25–32, 1989.

[GMS05] A. Gilbert, M. Muthukrishnan, and M. Strauss, “Improved time bounds for near-optimal space Fourier representations,” SPIE Conference, Wavelets, 2005.

[GST08] A. Gilbert, M. J. Strauss, and J. A. Tropp, “A tutorial on fast Fourier sampling,” IEEE Sig. Proc. Mag., vol. 25, no. 2, pp. 57–66, 2008.

[HAKI12] H. Hassanieh, F. Adib, D. Katabi, and P. Indyk, “Faster GPS via the sparse Fourier transform,” MOBICOM, 2012.

[HIKP12a] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Near-optimal algorithm for sparse Fourier transform,” ACM Symp. Theory Comp. (STOC), 2012.

[HIKP12b] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Simple and practical algorithm for sparse Fourier transform,” ACM-SIAM Symp. Disc. Alg. (SODA), 2012.

[HIKP12c] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Nearly optimal sparse Fourier trans-form,” in Proc. ACM Symp. Theory Comp. (STOC). ACM, 2012, pp. 563–578.

[HKPV13] S. Heider, S. Kunis, D. Potts, and M. Veit, “A sparse Prony FFT,” Samp. Theory Apps.

(SAMPTA), 2013.

[HMM93] T. Hagerup, K. Mehlhorn, and J. I. Munro, “Maintaining discrete probability distributions optimally,” in Int. Coll.. Auto., Lang., and Prog. (ICALP), 1993, pp. 253–264.

[IK14] P. Indyk and M. Kapralov, “Sample-optimal Fourier sampling in any fixed dimension,”

IEEE Symp. Found. Comp. Sci. (FOCS), 2014.

[IKP14] P. Indyk, M. Kapralov, and E. Price, “(Nearly) sample-optimal sparse Fourier transform,”

ACM-SIAM Symp. Disc. Alg. (SODA), 2014.

[IR13] P. Indyk and I. Razenshteyn, “On model-based RIP-1 matrices,” in Int. Coll. Auto., Lang., and Prog. (ICALP), 2013.

[Iwe10] M. A. Iwen, “Combinatorial sublinear-time Fourier algorithms,” Found. Comp. Math., vol. 10, pp. 303–338, 2010.

[Kap16] M. Kapralov, “Sparse Fourier transform in any constant dimension with nearly-optimal sample complexity in sublinear time,” ACM Symp. Theory Comp. (STOC), 2016.

[KM91] E. Kushilevitz and Y. Mansour, “Learning decision trees using the Fourier spectrum,”

ACM Symp. Theory Comp. (STOC), 1991.

[LWC12] D. Lawlor, Y. Wang, and A. Christlieb, “Adaptive sub-linear time Fourier algorithms,”

2012, http://arxiv.org/abs/1207.6368.

[Man92] Y. Mansour, “Randomized interpolation and approximation of sparse polynomials,” Int.

Coll. Auto., Lang., and Prog. (ICALP), 1992.

[PR13] S. Pawar and K. Ramchandran, “Computing a k-sparse n-length discrete Fourier trans-form using at most4k samples and O(k log k) complexity,” IEEE Int. Symp. Inf. Theory (ISIT), 2013.

[PS15] E. Price and Z. Song, “A robust sparse Fourier transform in the continuous setting,” IEEE Symp. Found. Comp. Sci. (FOCS), 2015.

[PW11] E. Price and D. P. Woodruff, “(1 + )-approximate sparse recovery,” IEEE Symp. Found.

Comp. Sci. (FOCS), 2011.

[RV08] M. Rudelson and R. Vershynin, “On sparse reconstruction from Fourier and Gaussian measurements,” Comm. Pure Appl. Math, vol. 61, no. 8, pp. 1025–1171, 2008.

Documento similar