Algorithm 9 Energy estimation procedure for individual frequencies
1: procedure EstimateValues(X,χ, L, n, kb 0, k1, δ, p)
2: B ← 1200δ k0k1 3: F ← 10 log1δ
4: G ← (n, B, F )-flat filter . See Definition 2.1
5: F ← {f ∈ [n] : round(kf
1) ∈ L}
6: T ← 10 log2p
7: fort ∈ {1, . . . , T } do
8: ∆ ← uniform random sample from [kn
1]
9: σ ← uniform random sample from odd numbers in [kn
1]
10: U ← HashToBins(X,b χ, G, n, B, σ, ∆)b . of(f ), h(f ) in Definition 4.2
11: Wf(t) ← bG−1o
f(f )Ubh(f )ω−σ∆f for eachf ∈ F
12: Wf ← Mediant(x(t)f ) for each f ∈ F . Separately for the real and imaginary parts
13: returnW
Once we have located the blocks, we need to estimate the frequency values with them. The function EstimateValues in Algorithm 9 performs this task for us via basic hashing techniques.
The following lemma characterizes the guarantee on the output.
Lemma 5.2 (Re-stated from Section 5.1) For any integers(n, k0, k1), list of block indices L, param-etersδ ∈ n1,201
and p ∈ (0, 1/2), and signals X ∈ Cn andχ ∈ Cb n with k bX −χkb 2≥ poly(n)1 kχkb 2, the outputW of the function EstimateValues(X,χ, L, n, kb 0, k1, δ, p) has the following property:
X
f ∈S
j∈LIj
|Wf − ( bX −χ)b f|2 ≤ δ|L|
3k0k bX −χkb 22
with probability at least 1 − p, where Ij is the j-th block. Moreover, the sample complexity is O(k0δk1 log1plog1δ), and if kχkb 0 = O(k0k1), then the runtime is O(k0δk1log1plog1δlog n+k1·|L| log1p).
Proof. LetX0 = X − χ, and let bU be the output of HashToBins and bU∗ its exact counterpart. We start by calculatingWf(t) for an arbitraryf ∈ F:
Wf(t) = bG−1o (σ, ∆) implicitly depending on t.
Bounding err(t)f and errf(t)f : Using Lemma G.1 in Appendix G, we have wherec is used in HashToBins, and c0 is the exponent in the poly(n) notation of the assumption k bX −χkb 2 ≥ poly(n)1 kχkb 2.
In order to characterize |Wf(t)− bXf0|2, we use the following:
|errff|2+ 2|err(t)f | · |errf(t)f | ≤ 4n2(−c+c0)k bX0k22+ 4n−c+c0k bX0k2· |err(t)f |. (117) which follows directly from (116).
Characterizing |Wf(t)− bXf0|2: We have from (114), (115), and (117) that where the last line follows by writing E
|err(t)f |
≤ q
E
|err(t)f |2
via Jensen’s inequality, and then applying (115).
Taking the median: Recall that Wf is the median of T independent random variables, with the median taken separately for the real and imaginary parts. Since |W |2 = |Re(W )|2+ |Im(W )|2, we find that (119) holds true whenWf(t)− bXf0 is replaced by its real or imaginary part. Hence, with probability at least1 − T /2T 1
6t
T /2
, we have |Re(Wf(t)− bXf0)|2 < 160tB k bX0k22, and analogously for the imaginary part. Combining these and applying the union bound, we obtain
P
where we first applied T /2T
≤ 2T, and then the choice T = 10 log2p from Algorithm 9 and the choice ofp ≤ 1/2.
We now bound the error as follows:
|Wf − bXf0|2 ≤ 320
We write the expected value of the second term as Eh|Wf− bXf0|2−320 where we applied the change of variablev = 1 + 320 u
B k bX0k22. By incorporating (120) into this integral,
where the second line is by explicitly evaluating the integral (withT > 2), and the third by T /2−1 ≥ 4 (cf., Algorithm 9).
Summing (121) over F = ∪j∈LIj, we find that the total error is upper bounded as follows:
X
with probability at least1 − p, by Markov’s inequality. The lemma now follows by the choice B =
1200
δ k0k1 in Algorithm 9.
Sample complexity and runtime: To calculate the sample complexity, note that the only opera-tion in the algorithm that takes samples is the call to HashToBins. By Lemma 4.6, and the choices B = 1200k0δk1 andF = 10 log1δ, the sample complexity isO(k0δk1 log1δ) per hashing performed. Since we run the hashing in a loop10 log2p times, this amounts to a total ofO(k0δk1log1δlogp1).
The runtime depends on two operations. The first one is calling HashToBins, whose analysis follows similarly to the aforementioned sample complexity analysis using the assumption kχkb 0 = O(k0k1), but with an extra log n factor compared to the sample complexity, as per Lemma 4.6.
The other operation is computation ofWf(t), which takes unit time for eachf ∈ F. Since the size of |F |= k1· |L|, running it in a loop costs O(k1· |L| logp1). Computing the median is done in linear time which consequently results in |F |T = O(k1· |L| log1p).
References
[AGS03] A. Akavia, S. Goldwasser, and S. Safra, “Proving hard-core predicates using list decoding,”
IEEE Symp. Found. Comp. Sci. (FOCS), vol. 44, pp. 146–159, 2003.
[Aka10] A. Akavia, “Deterministic sparse Fourier approximation via fooling arithmetic progres-sions,” Conf. Learn. Theory (COLT), pp. 381–393, 2010.
[Bac10] F. R. Bach, “Structured sparsity-inducing norms through submodular functions,” in Adv.
Neur. Inf. Proc. Sys. (NIPS), 2010, pp. 118–126.
[BBC14] B. Bah, L. Baldassarre, and V. Cevher, “Model-based sketching and recovery with ex-panders,” in ACM-SIAM Symp. Disc. Alg. (SODA), 2014.
[BBC+16] L. Baldassarre, N. Bhan, V. Cevher, A. Kyrillidis, and S. Satpathi, “Group-sparse model selection: Hardness and relaxations,” IEEE Trans. Inf. Theory, vol. 62, no. 11, pp. 6508–
6534, November 2016.
[BCDH10] R. Baraniuk, V. Cevher, M. Duarte, and C. Hegde, “Model-based compressive sensing,”
IEEE Trans. Inf. Theory, vol. 56, no. 4, pp. 1982–2001, April 2010.
[BCG+12] P. Boufounos, V. Cevher, A. C. Gilbert, Y. Li, and M. J. Strauss, “What’s the frequency, Kenneth?: Sublinear Fourier sampling off the grid,” RANDOM/APPROX, 2012.
[BCW10] R. Baraniuk, V. Cevher, and M. B. Wakin, “Low-dimensional models for dimensionality reduction and signal recovery: A geometric perspective,” Proc. IEEE, vol. 98, no. 6, pp.
959–971, 2010.
[CIHB09] V. Cevher, P. Indyk, C. Hegde, and R. Baraniuk, “Recovery of clustered sparse signals from compressive measurements,” in Int. Conf. Samp. Theory Apps. (SAMPTA), 2009.
[CT06a] E. Candes and T. Tao, “Near-optimal signal recovery from random projections: Universal encoding strategies,” IEEE Trans. Inf. Theory, vol. 52, no. 12, pp. 5406–5425, 2006.
[CT06b] T. M. Cover and J. A. Thomas, Elements of Information Theory. John Wiley & Sons, Inc., 2006.
[DIPW10] K. Do Ba, P. Indyk, E. Price, and D. P. Woodruff, “Lower bounds for sparse recovery,”
ACM-SIAM Symp. Disc. Alg. (SODA), 2010.
[EHC15] M. El Halabi and V. Cevher, “A totally unimodular view of structured sparsity,” in Int.
Conf. Art. Intel. Stats. (AISTATS), May 2015.
[FR13] S. Foucart and H. Rauhut, A Mathematical Introduction to Compressive Sensing.
Springer New York, 2013.
[GGI+02] A. Gilbert, S. Guha, P. Indyk, M. Muthukrishnan, and M. Strauss, “Near-optimal sparse Fourier representations via sampling,” ACM Symp. Theory Comp. (STOC), 2002.
[GHI+13] B. Ghazi, H. Hassanieh, P. Indyk, D. Katabi, E. Price, and L. Shi, “Sample-optimal average-case sparse fourier transform in two dimensions,” in Allerton Conf. Comm., Con-trol, and Comp., 2013.
[GIIS14] A. Gilbert, P. Indyk, M. Iwen, and L. Schmidt, “Recent developments in the sparse Fourier transform: A compressed Fourier transform for big data,” IEEE Sig. Proc. Mag., vol. 31, no. 5, pp. 91–100, 2014.
[GL89] O. Goldreich and L. Levin, “A hard-corepredicate for allone-way functions,” ACM Symp.
Theory Comp. (STOC), pp. 25–32, 1989.
[GMS05] A. Gilbert, M. Muthukrishnan, and M. Strauss, “Improved time bounds for near-optimal space Fourier representations,” SPIE Conference, Wavelets, 2005.
[GST08] A. Gilbert, M. J. Strauss, and J. A. Tropp, “A tutorial on fast Fourier sampling,” IEEE Sig. Proc. Mag., vol. 25, no. 2, pp. 57–66, 2008.
[HAKI12] H. Hassanieh, F. Adib, D. Katabi, and P. Indyk, “Faster GPS via the sparse Fourier transform,” MOBICOM, 2012.
[HIKP12a] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Near-optimal algorithm for sparse Fourier transform,” ACM Symp. Theory Comp. (STOC), 2012.
[HIKP12b] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Simple and practical algorithm for sparse Fourier transform,” ACM-SIAM Symp. Disc. Alg. (SODA), 2012.
[HIKP12c] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Nearly optimal sparse Fourier trans-form,” in Proc. ACM Symp. Theory Comp. (STOC). ACM, 2012, pp. 563–578.
[HKPV13] S. Heider, S. Kunis, D. Potts, and M. Veit, “A sparse Prony FFT,” Samp. Theory Apps.
(SAMPTA), 2013.
[HMM93] T. Hagerup, K. Mehlhorn, and J. I. Munro, “Maintaining discrete probability distributions optimally,” in Int. Coll.. Auto., Lang., and Prog. (ICALP), 1993, pp. 253–264.
[IK14] P. Indyk and M. Kapralov, “Sample-optimal Fourier sampling in any fixed dimension,”
IEEE Symp. Found. Comp. Sci. (FOCS), 2014.
[IKP14] P. Indyk, M. Kapralov, and E. Price, “(Nearly) sample-optimal sparse Fourier transform,”
ACM-SIAM Symp. Disc. Alg. (SODA), 2014.
[IR13] P. Indyk and I. Razenshteyn, “On model-based RIP-1 matrices,” in Int. Coll. Auto., Lang., and Prog. (ICALP), 2013.
[Iwe10] M. A. Iwen, “Combinatorial sublinear-time Fourier algorithms,” Found. Comp. Math., vol. 10, pp. 303–338, 2010.
[Kap16] M. Kapralov, “Sparse Fourier transform in any constant dimension with nearly-optimal sample complexity in sublinear time,” ACM Symp. Theory Comp. (STOC), 2016.
[KM91] E. Kushilevitz and Y. Mansour, “Learning decision trees using the Fourier spectrum,”
ACM Symp. Theory Comp. (STOC), 1991.
[LWC12] D. Lawlor, Y. Wang, and A. Christlieb, “Adaptive sub-linear time Fourier algorithms,”
2012, http://arxiv.org/abs/1207.6368.
[Man92] Y. Mansour, “Randomized interpolation and approximation of sparse polynomials,” Int.
Coll. Auto., Lang., and Prog. (ICALP), 1992.
[PR13] S. Pawar and K. Ramchandran, “Computing a k-sparse n-length discrete Fourier trans-form using at most4k samples and O(k log k) complexity,” IEEE Int. Symp. Inf. Theory (ISIT), 2013.
[PS15] E. Price and Z. Song, “A robust sparse Fourier transform in the continuous setting,” IEEE Symp. Found. Comp. Sci. (FOCS), 2015.
[PW11] E. Price and D. P. Woodruff, “(1 + )-approximate sparse recovery,” IEEE Symp. Found.
Comp. Sci. (FOCS), 2011.
[RV08] M. Rudelson and R. Vershynin, “On sparse reconstruction from Fourier and Gaussian measurements,” Comm. Pure Appl. Math, vol. 61, no. 8, pp. 1025–1171, 2008.