iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently

上传人：我*** IP属地：北京上传时间：2020-06-17 格式：PDF 页数：10 大小：8.42MB 积分：12 举报 版权申诉

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently_第2页

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently_第3页

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently_第4页

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently_第5页

已阅读5页，还剩5页未读，继续免费阅读

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

1、Push-pull Feedback Implements Hierarchical Information Retrieval Effi ciently Xiao Liu1Xiaolong Zou1Zilong Ji2Gengshuo Tian3 Yuanyuan Mi4Tiejun Huang1K. Y. Michael Wong5 Si Wu1 1School of Electronics Engineering and the interaction between the two pathways 33rd Conference on Neural Information Proce

2、ssing Systems (NeurIPS 2019), Vancouver, Canada. accomplishes the recognition task. Based on a similar idea, the theory of predictive coding proposes that the feedback from the higher cortex predicts the output of the lower cortex 22. Although the importance of feedback has been widely recognized in

3、 the fi eld, computational models elucidating how it works exactly remain poorly developed. Interestingly, the experiment data has unveiled a salient characteristic of feedback in the visual system 8,6. Fig.1A displays the neural population activities in V1 when a monkey was performing a contour int

4、egration task 8. In response to the visual stimulus, the neural activity in V1 increased at the early phase, displaying the push characteristic; and decreased at the late phase, displaying the pull characteristic. Multi-unit recording revealed that in the pull phase, there was strong negative feedba

5、ck from the higher cortex (V4) 6; while in the push phase, although the contributions of the feedforward input and feedback were mixed, causality analysis confi rmed that there indeed existed a feedback component 7. ABC Figure 1: A. The push-pull phenomenon of neural population activity. Data was re

6、corded at V1 of an awake monkey. The visual stimulus was a contour embedded in noises, which was onset att = 0. The blue curve shows the change of neural response over time, which increased at the early phase and decreased at the late phase, when the monkey recognized the contour. The red curve repr

7、esents the condition when the monkey did not recognize the contour, and the green curve the condition of no visual stimulus. Adopted from 8. B. A three-layer network storing hierarchical categories of objects, denoted, from bottom to top, as child, parent, and grandparent patterns, respectively. Bet

8、ween layers, neurons are connected by both feedforward and feedback connections. C. The branching tree displaying the categorical relationships between hierarchical memory patterns. The categorization of objects is based on their similarity/dissimilarity either at the image level or in the semantic

9、sense, and it is organized hierarchically, in the sense that objects belonging to the same category are more similar than those belonging to different ones. The experimental data has revealed that the brain encodes these similarities using overlapping neural representations, with the greater the sim

10、ilarity, the larger the correlation between neural responses 26. However, it is well known that neural networks have diffi culties of storing and retrieving correlated memory patterns; a small amount of correlation in the Hopfi eld model will deteriorate memory retrieval dramatically 9. Notably, thi

11、s instability is intrinsic to a neural network, as it utilizes synapses to carry out both information encoding and retrieving: the synaptic strengths are affected by the correlations of stored patterns, which in turn interfere with the retrieval of a stored pattern. Thus, a dilemma is raised to neur

12、al coding: on one hand, to encode the categorical relationships between objects, a neural system needs to exploit correlated neural representations; on the other hand, to retrieve information reliably, these patterns correlations are harmful. How to achieve reliable hierarchical information retrieva

13、l in a neural network remains unresolved in the fi eld 2, 13. In the present study, motivated by the push-pull phenomenon in neural data, we investigate the role of feedback in hierarchical information retrieval. Specifi cally, we consider that a neural system employs a rough-to-fi ne retrieval proc

14、edure, in which higher categorical representations of objects are retrieved fi rst, since they are less correlated than lower categorical ones and hence have better retrieval accuracy; subsequently, through feedback, the retrieved higher categorical information facilitates the retrieval of lower cat

15、egorical representations. We elucidate that the optimal feedback should be dynamical, varying from positive (push) to negative (pull) over time, and they suppress the interferences due to pattern correlations from different and the same categories, respectively. Using synthetic and real data, we dem

16、onstrate that the push-pull feedback implements hierarchical information retrieval effi ciently. 2 2A model for Hierarchical Information Representation To elucidate the role of feedback clearly, we consider a simple model for hierarchical information representation. The model is kept simple to illus

17、trate insights derived from the role played by different sources of interferences during different stages of dynamical retrieval. Specifi cally, the model consists of three layers which store three-level hierarchical memory patterns. For convenience, we call the three layers, from bottom to top, chi

18、ld, parent, and grandparent layers, respectively, to refl ect their ascending category relationships (Fig.1B). Neurons in the same layer are connected recurrently with each other to function as an associative memory. Between layers, neurons communicate via feedforward and feedback connections. Denot

19、e the state of neuroniin layerlat timetasxl i(t)for i = 1,.,N, which takes value of1, the symmetric recurrent connections from neuronjtoiin layerlasWl ij, the feedforward connections from neuronjof layerlto neuroniin layerl + 1as W l+1,l ij and the feedback connections from neuronjof layerl + 1to ne

20、uroniof layerlasW l,l+1 ij . The neuronal dynamics follows the Hopfi eld model 10, which is written as xli(t + 1) = sign ?hl i(t) ? ,(1) wheresign(h) = 1forh 0andsign(h) = 1otherwise.hl i(t)is the total input received by the neuron, which is given by (only the result for layer1is shown, and the resu

21、lts for other layers are similar), h1 i(t) = X j W1 ijx 1 j(t) + X j W 1,2 ij x2 j(t). (2) We generate synthetic patterns to study information retrieval, which are denoted as:for = 1,.,Prepresents the grandparent patterns,for = 1,.,Pthe parent patterns of grandparent, and,for = 1,.,Pthe child patter

22、ns of parent(,), whereP,P, andPare denoted as the number of grandparent patterns, the number of parent patterns belonging to the same grandparent, and the number of child patterns belonging to the same parent, respectively. These hierarchical memory patterns are constructed as follows 1 (Fig.1C). Fi

23、rst, grandparent patterns are statistically independent of each other. The value of each element in a grandparent pattern is drawn from the distribution P( i) = 1 2( i + 1) + 1 2( i 1),(3) where(x) = 1forx = 0and(x) = 0otherwise. Each element of a grandparent has equal probabilities of taking a valu

24、e of 1 or 1. Secondly, for each grandparent pattern, its descending parent patterns are drawn from the distribution P(, i )=(1 + b2 2 )(, i i) + ( 1 b2 2 )(, i + i), (4) where0 b2 0.5 to have the same value as the corresponding element of the grandparent. This establishes the relationship between a

25、grandparent and the parent patterns. Thirdly, for each parent pattern, its descending child patterns are drawn from the distribution P(, i )=(1 + b1 2 )(, i , i ) + (1 b1 2 )(, i + , i ),(5) where 0 0. Other correlation relationships can be obtained similarly (see Sec.1 in Supplementary Information

26、(SI). Each layer of the network behaves as an associative memory. Using the standard Hebbian learning rule, the recurrent connections between neurons in the same layer are constructed to beW1 ij = P , , i , j /N,W2 ij = P , , i , j /N, andW3 ij = P ij/N. The feedforward connections from lower to hig

27、her layers are set to beW 2,1 ij = P , , i , j /N, andW 3,2 ij = P , i , j /N. It is easy to understand the effect of feedforward connections. For example, if layer 1is at the state of the memory pattern0,0,0, then the feedforward input to layer2is given by P jW 2,1 ij , j 0,0 i , which contributes

28、to improving the retrieval of the parent pattern0,0at layer 2. The form of feedback connections is the focus of this study and will be introduced later. To quantify the retrieval performance, we defi ne a macroscopic variablem(t), measuring the overlap between the neural statex(t)and a memory patter

29、n, which is calculated to be 9 (again, for simplicity, only the result for layer 1 is shown), m,(t) = 1 N N X i=1 , i x1 i(t). (6) where1 m,(t) 1represents the retrieval accuracy of the memory pattern, and the larger the value of m, the higher the retrieval accuracy. 3Information retrieval without f

30、eedback To elucidate the role of feedback, it is valuable to fi rst check information retrieval without feedback, and without loss of generality, we focus on layer1. Following the standard stability analysis 9, we consider that the initial state of layer1is a memory pattern,x1(0) = 0,0,0, and invest

31、igate what are the key factors determining the retrieval performance. After one step of iteration, we get the retrieval accuracy, m0,0,0(1)= 1 N N X i=1 0,0,0 i x1 i(1) = 1 N N X i=1 0,0,0 i sign ?h1 i(0) ? , = 1 N N X i=1 sign h 0,0,0 i h1 i(0) i .(7) We see that the retrieval of a memory pattern i

32、s determined by the alignment between the neural input and the memory pattern, which is further written as (see Sec.2 in SI), 0,0,0h1 i(0) = 0,0,0 1 N X j W1 ijx 1 j(0) = 1 + Ci+ f Ci.(8) Here, the input received by the neuron is decomposed into the signal and noise parts, and the latter is further

33、divided into two components,Ciand e Ci, which represent, respectively, the interferences to memory retrieval due to: 1) the correlation of the pattern to be retrieved with siblings from the same parent, called theintra-class interference; 2) the correlation of the pattern to be retrieved with cousin

34、s from the same grandparent but different parents, called theinter-class interference. It can be checked that in the limits of largeN,PandP, the intra- and inter- class interferences, Ciand e Ci, satisfy the distributions,P(Ci) = N(EC,VC)(1 + b1)/2 + N(EC,VC)(1 b1)/2, P(eCi) = N(Ee C,VeC)(1 + b1b2)/

35、2 + N(EeC,VeC)(1 b1b2)/2, where N(E,V )represents a normal distribution with meanEand varianceV, andEC= b3 1(P 1),EeC = b3 1b32P(P 1), VC= b4 1(P 1)(1 b21), VeC = b4 1b42P(P 1)(1 b21)(1 b22) (see Sec.2 in SI). The breadth of the above noise distributions, as a consequence of pattern correlations, im

36、plies that even starting from a noiseless state, the network dynamics still incur retrieving instability 9, and the error occurs when noises are large (i.e., Ci+ e Ci b2. For an associative memory, this implies that given the same amount of input information (e.g., an ambiguous image of a Siamese ca

37、t), the parent pattern (e.g., a cat) can be better retrieved than the child pattern (e.g., a Siamese cat). Thus, we consider a rough-to-fi ne retrieval procedure, in which the parent pattern in layer2 is fi rst retrieved, whose result is subsequently fed back to layer 1 to improve the retrieval of t

38、he child pattern. Below, for the convenience of analysis, we assume that the parent pattern is fi rst perfectly retrieved (m = 1 ) and explore the appropriate form of feedback which can effi ciently utilize the parent information to enhance the retrieval of the child pattern. Later, we carry out sim

39、ulations demonstrating that the model works in general cases when the parent pattern is not perfectly retrieved. 4.1The form and the role of push feedback We fi rst show that a push (positive) feedback of a proper form can suppress the inter-class interference in memory retrieval effectively. Withou

40、t loss of generality, we consider that for a given input, the corresponding child pattern to be retrieved in layer1is0,0,0and that the corresponding parent pattern in layer 2 is 0,0. Consider the push feedback of the below form, W 1,2 ij = 1 NP X , , i , j ,(9) which follows the standard Hebb rule b

41、etween the parent and their child patterns, and its contribution is intuitively understandable. Given that the parent pattern0,0in layer2is retrieved, its push feedback to neuroniin layer1is calculated to be P jW 1,2 ij 0,0 j P 0,0, i /P. Obviously, this positive current increases the activities of

42、all child patterns belonging to the parent, i.e., those 0,0,for any , and it has little infl uence on other child patterns from different parents, i.e., those 0,for 6= 0. Due to the competition between memory patterns in the network dynamics, this effectively suppresses the inter-class interference

43、in memory retrieval (Fig.2A ). -0.1 0 0.1 0.2 0.3 0.4 without feedback with push-feedback m1,1,/P M m1,2,/P M 0.4 0.3 0.2 0.1 0 -0.1 -2-1012 Ci,Ci * 0 0.2 0.4 0.6 P(Ci),P(Ci *) AB ci ci* 0.8 Figure 2: A. Illustrating the effect of push-feedback. The parent pattern is1,1, whose feedback contribution

44、to sibling patterns is measured by P m 1,1,/P, and to cousin patterns is measured by P m 1,2,/P. The results averaged over 100trials are shown. B. Illustrating the effect of pull- feedback. The distributions of the intra-class noiseCiwithout feedback and the noiseC i with the pull-feedback are prese

45、nted (see Eqs.(11,14) in SI). Retrieval errors occur whenCi 1orC i 1 (indicated by the yellow line). The parameters areN = 2000,P= 2,P= 10,P= 70,b1= 0.2, and b2= 0.15. 4.2The form and the role of pull feedback We further show that a pull (negative) feedback of an appropriate form can suppress the in

46、tra-class interference in memory retrieval. Consider the pull feedback of the below form, W 1,2 ij = b1ij.(10) 5 Given the parent pattern0,0in layer2is retrieved, its negative feedback to neuroniin layer 1is calculated to be P jW 1,2 ij 0,0 j = b10,0 i . In the largePlimit, the parent pattern is app

47、roximated to be the mean of its child patterns (Sec.1 in SI), thus, in effect the pull feedback is to subtract a portion of the mean value from sibling patterns. The retrieval accuracy of the target child pattern after applying the pull feedback is calculated to be (Sec.3 in SI), m0,0,0= 1 N N X i=1

48、 sign h 1 + C i + e Ci i ,(11) whereC i Cib10,0,0 i 0,0 i is the new noise term after applying the pull-feedback. As shown in Fig.2B, with the pull feedback, the negative tail of the noise distribution (where retrieval errors occur) is considerably reduced. 4.3The joint effect of the push-pull feedb

49、ack Summarizing the above results, we come to the conclusion that to achieve good information retrieval, the neural feedback needs to be dynamical, exerting the push and pull components at different stages, so that they can suppress the inter- and intra- class interferences, respectively. To better

50、demonstrate the joint effect of the push-pull feedback, we consider a continuous version of the Hopfi eld model, so that the network state changes smoothly and the joint effects of push- and pull- feedbacks are integrated over time (the discrete Hopfi eld model still works, but the overall effect is

51、 less signifi cant). The network dynamics are given by 11 dhn i dt =hn i + X j Wn ijx n j + X k W n,m ik (t)xm k + Iext,n i ,(12) xn i =f(hn i), m,n = 1,2,(13) wherehn i andxn i denote the synaptic input and the fi ring rate of neuroniin layern, respectively, and their relationship is given by a sig

52、moid function,f(x) = arctan(8x)/ + 1/2, therefore 0 xn i 1 .To match the strength of fi ring ratexn i, we also align all the hierarchical patternsi into0,1. The parameteris the time constant. The recurrent and feedforward connections follow the standard Hebb rule as described above. The feedback con

53、nections are slightly modifi ed from Eqs.(9-10) to accommodate positive values of neural activities in the continuous model. They are given by: the push-feedbackW 1,2 ik = a+P P ,( , i hi)(, k hi)/N, witha+being a positive number, and the pull-feedbackW 1,2 ik = ab1ik, withabeing a positive number.I

54、ext i is the external input conveying the object information. The push and pull feedbacks are applied sequentially, with each of them lasting in the time order of( 10 20ms), as suggested by the data 6. For details of the model, see Sec.4.1 in SI. Fig.3 displays a typical example of the memory retrie

55、val process in the network, demonstrating that: 1) the neural population activity at layer1exhibits the push-pull phenomenon, agreeing qualitatively with the experimental data (Fig.3A compared to Fig.1A); 2) the retrieval accuracy of layer1with the push-pull feedback is improved signifi cantly compa

56、red to the case of no feedback (Fig.3B). Interestingly, we note that when the push feedback is applied, the retrieval accuracy of the target child pattern is decreased a little bit. This is due to that the push feedback only aims at reducing the inter-class interference without differentiating sibli

57、ng patterns. We evaluate the performances of the model by varying the amplitude of pattern correlations and confi rm that the push-pull feedback always improves the network performance statistically (Sec.4.2 in SI). 5Applying to Real Images We test our model in the processing of real images. As show

58、n in the top of Fig.4A, the dataset we use consists ofP= 2types of animals, cats and dogs, corresponding to parents in our model. For each type of animal, it is further divided intoP= 9sub-types, corresponding to children. A total of1800 images, withK = 100for each sub-type of animals, are chosen from ImageNet. It has been shown that the neural representations generated by a DNN (after being trained with ImageNet) capture the 6 Time ( ) 0 1 23 4 5 Input Push Pull Time ( )

人人文库> 全部分类> 应用文书

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently

文档简介

温馨提示

最新文档

评论

iccv2019论文全集8807-push-pull-feedback-implements-hierarchical-ination-retri-efficiently

文档简介

温馨提示

最新文档

评论

相关文档