iccv2019论文全集9436-band-limited-gaussian-processes-the-sinc-kernel_第1页
iccv2019论文全集9436-band-limited-gaussian-processes-the-sinc-kernel_第2页
iccv2019论文全集9436-band-limited-gaussian-processes-the-sinc-kernel_第3页
iccv2019论文全集9436-band-limited-gaussian-processes-the-sinc-kernel_第4页
iccv2019论文全集9436-band-limited-gaussian-processes-the-sinc-kernel_第5页
已阅读5页,还剩6页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、Band-Limited Gaussian Processes: The Sinc Kernel Felipe Tobar Center for Mathematical Modeling Universidad de Chile ftobardim.uchile.cl Abstract We propose a novel class of Gaussian processes (GPs) whose spectra have compact support, meaning that their sample trajectories are almost-surely band limi

2、ted. As a complement to the growing literature on spectral design of covariance kernels, the core of our proposal is to model power spectral densities through a rectangular function, which results in a kernel based on the sinc function with straightforward extensions to non-centred (around zero freq

3、uency) and frequency-varying cases. In addition to its use in regression, the relationship between the sinc kernel and the classic theory is illuminated, in particular, the Shannon-Nyquist theorem is interpreted as posterior reconstruction under the proposed kernel. Additionally, we show that the si

4、nc kernel is instrumental in two fundamental signal processing applications: fi rst, in stereo amplitude modulation, where the non-centred sinc kernel arises naturally. Second, for band-pass fi ltering, where the proposed kernel allows for a Bayesian treatment that is robust to observation noise and

5、 missing data. The developed theory is complemented with illustrative graphic examples and validated experimentally using real-world data. 1Introduction 1.1Spectral representation and Gaussian processes The spectral representation of time series is both meaningful and practical in a plethora of scie

6、n- tifi c domains. From seismology to medical imagining, and from astronomy to audio processing, understanding which fraction of the energy in a time series is contained on a specifi c frequency band is key for, e.g., detecting critical events, reconstruction, and denoising. The literature on spectr

7、al estimation 13,24 enjoys of a long-standing reputation with proven success in real-world applica- tions in discrete-time signal processing and related fi elds. For unevenly-sampled noise-corrupted observations, Bayesian approaches to spectral representation emerged in the late 1980s and early 1990

8、s 4,11,8 , thus reformulating spectral analysis as an inference problem which benefi ts from the machinery of Bayesian probability theory 12. In parallel to the advances of spectral analysis, the interface between probability, statistics and machine learning (ML) witnessed the development of Gaussia

9、n processes (GP, 21), a nonparametric generative model for time series with unparalleled modelling abilities and unique conjugacy properties for Bayesian inference. GPs are the de facto model in the ML community to learn (continuous- time) time series in the presence of unevenly-sampled observations

10、 corrupted by noise. Recent GP models rely on Bochner theorem 2, which indicates that the covariance kernel and power spectral density (PSD) of a stationary stochastic process are Fourier pairs, to construct kernels by direct parametrisation of PSDs to then express the kernel via the inverse Fourier

11、 transform. The precursor of this concept in ML is the spectral-mixture kernel (SM, 32), which models PSDs as Gaussian 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada. RBFs, and its multivariate extensions 29,19. Accordingly, spectral-based sparse GP approx

12、imations 15, 10, 5, 14 also provide improved computational effi ciency. 1.2Contribution and organisation A fundamental object across the signal processing toolkit is the normalised sinc function, defi ned by sinc(x) = sinx x .(1) Its importance stems from its role as the optimal basis for reconstruc

13、tion (in the Shannon-Whittaker sense 31) and the fact that its Fourier transform is the rectangle function, which has compact support. Our hypothesis is that the symbiosis between spectral estimation and GPs can greatly benefi t from the properties of kernels inspired the sinc function, yet this has

14、 not been studied in the context of GPs. In a nutshell we propose to parametrise the PSD by a (non-centred) rectangular function, thus yielding kernels defi ned by a sinc function times a cosine, resembling the SM kernel 32 with the main distinction that the proposed PSD has compact, rather than inf

15、i nite, support. The next section introduces the proposed sinc kernel, its centred/non-centred/frequency-varying versions as well as its connections to sum-of-infi nite-sinusoids models. Section 3 interprets posterior reconstruction using the sinc kernel from the Shannon-Nyquist perspective. Then, S

16、ections 4 and 5 revise the role of the sinc kernel in two signal processing applications: stereo demodulation and band-pass fi ltering. Lastly, Section 6 validates the proposed kernel through numerical experiments with real-world signals and Section 7 presents the future research steps and main conc

17、lusions. 2Compact spectral support via the sinc kernel The Bochner theorem 2 establishes the connection between a (stationary) positive defi nite kernelK and a density S via the Fourier transform F , that is, K(t) = F1S()(t),(2) where the functionS : Rn7 R+is Lebesgue integrable. This result allows

18、us to design a valid positive defi nite functionKby simply choosing a positive functionS(a much easier task), to then anti-Fourier transform it according to eq.(2). This is of particular importance in GPs, where we can identifyKas the covariance kernel andSthe spectral density, therefore, the design

19、 of the GP can be performed in the spectral domain rather than the temporal/spatial one. Though temporal construction is the classical alternative, spectral-based approaches to covariance design have become popular for both scalar and vector-valued processes 32,29,19, and even for nonstationary 22 a

20、nd nonparametric 27, 26 cases. We next focus on GPs that are bandlimited, or in other words, that have a spectral density with compact support based on the sinc kernel. 2.1Construction from the inverse Fourier transform of rectangular spectrum Let us denote the rectangular function given by rect() d

21、ef = 1| 0;0,2 0, and note that the rectangles are allowed to overlap if 20. We can then calculate the kernel associated with the PSD given byS() = simrect0,2()using the standard properties of the Fourier transform, in particular, by identifying the symmetric rectangle function in eq.(4)as a convolut

22、ion between a (centred) rectangle and two Dirac delta functions on 0,0 . We defi ne this kernel as follows. Defi nition 1(The Sinc Kernel).The stationary covariance kernel resulting from the inverse Fourier transform of the symmetric rectangle function in eq. (4) given by SK(t) def = 2sinc(t)cos(20t

23、)(5) is referred to asthe sinc kernelof frequency0 0, bandwidth 0and magnitude2 0. The expressionsinc(t) = sint t is known as the the normalised sinc function, and when0= 0we refer to the above expression as the centred sinc kernel. Being positive defi nite by construction, the sinc kernel can be us

24、ed within a GP for training, inference and prediction. Thus, we implemented a GP with the sinc kernel (henceforthGP-sinc) for the interpolation/extrapolation of a heart-rate time series from the MIT-BIH database 7. Using one third of the data, training the GP-sinc (plus noise variance) was achieved

25、by maximum likelihood, were both the BFGS 33 and Powell 20 optimisers yielded similar results. Fig. 1 shows the leant PSD and kernel alongside the periodogram for comparison, and a sample path for temporal reconstruction and forecasting. We highlight that the sinc function implemented in Python used

26、 in this optimisation was numerically stable for both optimisers and multiple initial conditioned considered. 0.000.010.020.030.040.05 frequency Hz 0 2000 4000 6000 power spectral density pgram learnt psd 1000100 time seconds 0 10 20 30 learnt covariance kernel 02004006008001000 time seconds 10 0 10

27、 observations, ground truth and GP-sinc reconstruction ground truthobservationsGP-sinc Figure 1: Implementation of the sinc kernel on a heart-rate time series. Notice that (i) the learnt kernel shares the same support as the periodogram, (ii) the error bars in the reconstruction are tight, and (iii)

28、 the harmonic content in the forecasting part is consistent with the ground truth. 2.2 Construction from a mixture of infi nite sinusoids Constructing kernels for GP models as a sum of infi nite components is known to aid the interpretation of its hyperparameters 21 . For the sinc kernel, let us con

29、sider an infi nite sum of sines and cosines with frequencies uniformly drawn between0 2 and0+ 2, and with random magnitudes given i.i.d. according to (),() N(0,2). That is, f(t) = Z 0+ 2 0 2 ()sin(2t) + ()cos(2t)d.(6) The kernel corresponding to this zero-mean GP can be calculated using basic proper

30、ties of the Fourier transform, trigonometric identities and the independence of the components magnitudes. This kernel is stationary and given by the sinc kernel defi ned in eq. (5): K(t,t0) = Ef(t)f(t0) = 2sinc(t t0)cos(20(t t0) = SK(t t0).(7) The interpretation of this construction is that the pat

31、hs of a GP-sinc can be understood as having frequency components that are equally present in the range between0 2 and0+ 2. On the contrary, frequency components outside this range have zero probability to appear in the GP-sinc sample paths. In this sense, we say that the sample trajectories of a GP

32、with sinc kernel are almost surely band-limited, where the band is given by ? 0 2,0 + 2 ?. 3 2.3Frequency-varying spectrum The proposed sinc kernel only caters for PSDs that are constant in their (compact) support due to the rectangular model. We extend this construction to band-limited processes wi

33、th a PSD that is a non-constant function of the frequency. This is equivalent to modelling the PSD as S() = simrect0,2()(),(8) where the symmetric rectangle gives the support to the PSD and the functioncontrols the frequency dependency. Notice that the only relevant part ofis that in the support ofs

34、imrect0,2(), furthermore, we assume thatis non-negative, symmetric and continuous almost everywhere (the need for this will be clear shortly). From eq. (8), the proposed sinc kernel can be generalised for the frequency-varying case as GSK(t) def = F1 ?simrect 0,2()() ? = SK(t) ? K(t),(9) referred to

35、 asgeneralised sinc kernel, and whereK(t) = F1() is a positive defi nite function due to (i) the Bochner theorem and (ii) the fact that () is symmetric and nonnegative. The convolution in the above equation can be computed analytically only in a few cases, most notably whenK(t)is either a cosine or

36、another sinc function, two rather limited scenarios. In the general case, we can exploit the fi nite-support property of the PSD in eq.(8)and express it as a sum ofN N narrower disjoint rectangles of width N to defi ne an N-th order approximation of GSK(t) through GSK(t) = N X i=1 F1 n simrect(i) 0

37、, N,2 ()() o N X i=1 ? (i) 0 ? F1 n simrect(i) 0 , N,2 () o (10) = sinc Nt N X i=1 ? (i) 0 ? cos ? 2(i) 0 ? def = GSKN(t) where(i) 0 = 0 N+12i 2N , and the approximation in eq.(10)follows the assumption that() can be approximated by ? (i) 0 ? within (i) 0 2N, (i) 0 + 2N supported by the following re

38、mark. Remark 2.Observe that the expression in eq.(10)can be understood as a Riemann sum using the mid-point value. Therefore, convergence ofGSKN(t)toGSK(t)asN goes to infi nite is guaranteed provided that()is Riemman-integrable, or, equivalently,()is continuous almost everywhere. This is a sound req

39、uirement in our case, since it is a necessity to compute inverse Fourier transforms. 3Relationship to Nyquist frequency and perfect reconstruction The NyquistShannon sampling theorem specifi es a suffi cient condition for perfect, i.e., zero error, reconstruction of band-limited continuous-time sign

40、als using a fi nite number of samples 23,17. Since (i) GPs models are intrinsically related to reconstruction, and (ii) the proposed sinc kernel ensures band-limited trajectories almost surely, we now study the reconstruction property the GP-sinc from a classical signal processing perspective. Let u

41、s focus on the baseband case (0= 0), in which case we obtain thecentredsinc kernel given by SK(t) = 2sinc(t).(11) For a centred GP-sinc,f(t) GP(0,sinc2,0,), the Nyquist frequency is given by the width of its PSD, that is,. The following Proposition establishes the interpretation of Nyquist perfect r

42、econstruction from the perspective of a vanishing posterior variance for a centred GP-sinc. Proposition 3.The posterior distribution of a GP with centred sinc kernel concentrates on the WhittakerShannon interpolation formula 23,31 with zero variance when the observations are noiseless and uniformly-

43、spaced at the Nyquist frequency 17. 4 Proof. Let us fi rst considern Nobservations taken at the Nyquist frequency with timestn= t1,.,tn and values yn= y1,.,yn. With this notation, the posterior GP-sinc is given by p(f(t)|yn) = GP(SK(t,tn)1yn,SK(t,t0) SK(t,tn)1SK(t0,tn),(12) where = SK(tn,tn)is the c

44、ovariance of the observations andSK(t,t)denotes the vector of covariances with the term SK(t,ti) = SK(t ti) in the ith entry. A key step in the proof is to note that the covariance matrixis diagonal. This is because the difference between any two observations times,ti,tj, is a multiple of the invers

45、e Nyquist frequency 1, and the sinc kernel vanishes at all those multiples except fori = j; see eq.(11). Therefore, replacing the inverse matrix1= 2Inand the centred sinc kernel in eq.(11)into eq.(12)allows us to write the posterior mean and variance (choosing t = t0above) respectively as Ef(t)|yn =

46、 n X i=1 yisinc(t ti),Vf(t)|yn = 2 1 n X i=1 sinc2(t ti) ! . (13) For the fi rst part of the proof, we can just applylimnto the posterior mean and readily identify the Shannon-Whittaker interpolation formula: a convolution between the sinc function and the observations. To show that the posterior va

47、riance vanishes asn , we proceed by showing that the Fourier transform of the sum of square sinc functions in eq.(13)converges to a Dirac delta (at zero) of unit magnitude instead, as these are equivalent statements. Denote bytri()the triangular function and observe that F ( X i=1 sinc2(t ti) ) = F

48、?sinc2(t)? F ( X i=1 ti ) conv. def. this is needed in the GP literature. This is because a direct consequence of Proposition 3 is a quantifi cation of the number of observations needed to have zero posterior variance (or reconstruction error). This is instrumental to design sparse GPs where the num

49、ber of inducing variables is chosen with a sound metric in mind: proximity to the Nyquist frequency. Finally, extending the above result to the non-baseband case can be achieved through frequency modulation, the focus of the next section. 4Stereo amplitude modulation with GP-sinc We can investigate

50、the relationship between trajectories of GPs both for non-centredeq.(5) and centredeq.(11) sinc kernels using a latent factor model. Specifi cally, let us consider two i.i.d. GP-sinc processesx1,x2 GP(0,2sinc(t)with centred sinc kernel and construct the factor model x(t) = x1cos(20t) + x2sin(20t).(1

51、5) Observe that, due to independence and linearity, the processxin eq.(15)is a GP with zero mean and covariance given by a non-centred sinc kernel2 Kx(t,t0) = Ex(t)x(t0) = 2sinc(t t0)cos(20(t t0) = SK(t t0).(16) This result can also be motivated by the following decomposition of the sinc kernel: SK(

52、t t0) = ?cos2 0t sin20t ?2 sinc(t t0)0 02sinc(t t0) ?cos2 0t0 sin20t0 ? .(17) 2This follows directly from the identity cos(1 2) = cos(1)cos(2) + sin(1)sin(2)choosing i= 20tifor i = 1,2 5 The above matrix can be interpreted as the covariance of a multioutput GP 16,1, where the two channelsx1,x2are in

53、dependent due to the block-diagonal structure. Then, the trajectories of the non-centred sinc kernel can be simulated by sampling the two channels in this MOGP, multiply one of them by a sine and the other one by a cosine, to fi nally sum them together. The outlined relationship between centred and

54、non-centred sinc trajectories is of particular interest in stereo modulation/demodulation 18 applications from a Bayesian nonparametric perspective. This is because we can identify the two independent draws from the centred sinc kernel as lower frequency signals containing information (such as stere

55、o audio, bivariate sensors, or two-subject sensors) and the deterministic higher frequency sine and cosine signals as a carrier. In this setting, since the paths of a GP-sinc are equal (in probability) to those of the factor model presented in eq.(15), we can consider the GP-sinc as a generative mod

56、el for stereo amplitude modulation. Recall that the very objective in stereo demodulation is to recover the latent information signals, henceforth referred to as channels, at the receivers end from (possibly corrupted) observations. In this regard, the sinc kernel represents a unique contribution, s

57、ince Bayesian signal recovery under noisy/missing observations is naturally handled by GP models. In simple terms, for a stereo modulated signal with carrier frequency 0and bandwidth , the posterior over channels xii=1,2 wrt an observation x (of the modulated signal) is jointly Gaussian and given by

58、 p(xi(t)|x) = GP(K xi,x(t) 1x,Kx i(t t 0) K xi,x(t) 1Kx,x i(t 0), (18) where = SK(t,t)+I2 noiseis the covariance of the observations,Kxi(tt 0)is the prior covariance of channelxi(t), andKxi,x(t)is the covariance between observationsxand channelxi(t)given by Kxi,x(t) = Exi(t)x(t) = 2sinc(t t)cos(20t),(19) where we have used the same notation as eq. (12). Figure2showsanillustrativeimplementationof GP-sincdemodulation, where theassociatedchannels were recovered form non-uniform observations of a sinc-G

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论