版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Bibexcel is designed as a tool box for manipulating bibliographic data. The result of your manipulations will be saved in files that can be opened with Excel or any other software reading text-files tabbed into columns. Bibexcel lets you combine information from several fields of a document record,
2、count frequencies, co-occurrences and shared units (bibliographic coupling). Among other things there is also a procedure for finding citation links among the documents within a given set. Above all, the tools can be combined - the result of using them depends far more on your own imagination than t
3、he tools themselves.To understand what Bibexcel can be used for imagine the following situation.You have decided to analyze the development of memory research. Some basic questions might be:Who are the most productive authors, universities or countries?Which journals are used for publishing memory r
4、esearch?Using Social Sciences Citation Index, SSCI-CDE (compact disk edition) you have downloaded a set of records that contain the word memory in the title field. A total of 246 records are saved in a file called memory.doc. The first record looks like this:FN- Social Sciences Citation Index (Jan 8
5、1 - Dec 85) GA- AHQ96|TI- MEMORY ACCESSIBILITY AND TASK INVOLVEMENT AS FACTORS IN CHOICE|LA- ENGLISH|AU- GARDIAL SF; BIEHAL GJ|CS- UNIV HOUSTON/HOUSTON/TX/77004|JN- ADVANCES IN CONSUMER RESEARCH, 1985, V12, P414-419|PY- 1985|DT- ARTICLE|NR- 17|CR- BATRA R, 1983, V10, P309, ADV CONSUM RES BETTMAN JR,
6、 1979, INFORMATION PROCESSI BETTMAN JR, 1980, V7, P148, ADV CONSUMER RES BETTMAN JR, 1980, V7, P234, J CONSUMER RES BIEHAL G, 1982, V8, P431, J CONSUMER RES BIEHAL G, 1983, V10, P1, J CONSUM RES COLLINS AM, 1975, V82, P407, PSYCHOL REV CRAIK FIM, 1972, V11, P671, J VERB LEARN VERB BE EAGLE M, 1964,
7、V68, P58, J EXP PSYCHOL JOHNSON EJ, 1984, V11, P542, J CONSUMER RES LEAVITT C, 1981, V8, P15, ADV CONSUMER RES MITCHELL AA, 1981, V8, P25, ADV CONSUMER RES PETTY RE, 1981, COGNITIVE RESPONSES PETTY RE, 1983, V10, P135, J CONSUM RES TULVING E, 1966, V5, P381, J VERB LEARN VERB BE TULVING E, 1971, V87
8、, P1, J EXP PSYCHOL TULVING E, 1973, V80, P352, PSYCHOL REV|The record has a number of fields ending with a spike |, and the record ends with an extra spike at the end of the last field. Each field starts with a tag in the first four columns, TI- for title etc. A field may have several parts (such a
9、s authors) we will call such parts units.Record and document means the same thing from now on.To find the most productive authors, Bibexcel will read the AU-field and save that information in a file named memory.out that looks like this (See Preparing the data):1BIEHAL GJ1GARDIAL SF2TULVING E3MOSCOV
10、ITCH M4BACKMAN L4HARDY J4NILSSON LG4WINBLAD B5WEINGARTNER H6ROBERTS JVcont.The first two lines contain the authors of document nr 1, the document number is listed first and then the author names. A tab separates the two columns, which enables you to open the file in Excel. Also note that the author
11、names within a document are sorted in alphabetic order, to enable more effective analysis later on. But dont worry, if you want to keep the ordering of author-names there are tricks to do it!And finally, who has published the most? Well, Bibexcel reads memory.out and sorts it by author, then counts
12、the number of occurrences and write the frequencies to a file called memory.cit which look like this (See Frequency distribution):5KAUSLER DH4KASZNIAK AW4ACKERMAN BP4WILSON RS4GLOVER JA3PARKIN AJ3JACOBY LL3WOLTERS G3PUCKETT JM3BACON LD3TILL RE3TULVING E3YESAVAGE JAA various number of other procedure
13、s or tools are available both for editing and analyzing your records. In fact there is hardly any limit to what the tools and their combination can achieve.I do hope that you have got the general idea!There is much more to learn! Bibexcel offers a number of tools for extracting and analyzing various
14、 fields. Bibexcel helpPreparing the data/making the outfileThe drive, directory and file list boxes, can be used to select the file that you wish to analyze. When clicked the selected file appears in the label-box under the file list. This file can be viewed by clicking on the View-button, at least
15、the first 500 rows or so.You should first select which field to analyze. Chose one of the options and then press the Prep-button. When done the program makes an out-file (filename.out), that for each unit in a field has one row. The row begins with the document number.CR- Cited referenceThis option
16、makes an out-file with the cited references looking like this:BATRA R, 1983, V10, P309, ADV CONSUM RES1BETTMAN JR, 1979, INFORMATION PROCESSI1BETTMAN JR, 1980, V7, P148, ADV CONSUMER RES1BETTMAN JR, 1980, V7, P234, J CONSUMER RES1BIEHAL G, 1982, V8, P431, J CONSUMER RES1BIEHAL G, 1983, V10, P1, J CO
17、NSUM RES1COLLINS AM, 1975, V82, P407, PSYCHOL REV1CRAIK FIM, 1972, V11, P671, J VERB LEARN VERB BE1EAGLE M, 1964, V68, P58, J EXP PSYCHOL1JOHNSON EJ, 1984, V11, P542, J CONSUMER RES1LEAVITT C, 1981, V8, P15, ADV CONSUMER RES1MITCHELL AA, 1981, V8, P25, ADV CONSUMER RES1PETTY RE, 1981, COGNITIVE RESP
18、ONSES1PETTY RE, 1983, V10, P135, J CONSUM RES1TULVING E, 1966, V5, P381, J VERB LEARN VERB BE1TULVING E, 1971, V87, P1, J EXP PSYCHOL1TULVING E, 1973, V80, P352, PSYCHOL REV2Here comes the first reference in document nr 2.Any ; separated fieldSome fields have several units, several authors in the au
19、thor field, or several addresses in the corporate source field. Now, if we want to separate them, we can make one row for each unit. First you must write the relevant field tag in the box Old tag, down to the left. Then press the Prep-button. The filename.out may look like this if you type au in the
20、 Old tag-box:1BIEHAL GJ1GARDIAL SF2TULVING E3MOSCOVITCH M4BACKMAN L4HARDY J4NILSSON LG4WINBLAD B5WEINGARTNER H6ROBERTS JVcont.636RAAIJMAKERS JGW637ROEDIGER HLThe first document is obviously co-authored.JN- JournalChoosing this will make one row for each document with the journal name:1ADVANCES IN CO
21、NSUMER RESEARCHJournal, year, vol, page.and this will make a row for the complete journal source data:1ADVANCES IN CONSUMER RESEARCH, 1985, V12, P414-419Blank-separated words(e.g. title).makes an outfile in which each word in title field or any other field is separated, by a blank, and then sorted:1
22、ACCESSIBILITY1CHOICE1FACTORS1INVOLVEMENT1MEMORY1TASK2HOW2MANY2MEMORY-SYSTEMS2THERE3IMPLICATIONS3INFANCY3MEMORY3MEMORY3NORMAL3OLD-AGE3PATHOLOGICAL3THEORIESA list of stop words prevents listing of less significant words and characters. If a word is found in this string it will be deleted: a an and are
23、 as at be but by for from had have he her his in is it not of on or that the this to was which with you - 1 2 3 4 5 6 7 8 9 To remove more insignificant words, import the outfile into Excel and use Pivot-table to find word frequencies, or calculate frquencies with Bibexcel. Save the rows containing
24、the words you really want to be included into an unt-file (filename.unt). Keep the frequencies in the left column and theunit in the second! Then use Analyze/Co-occurence/Select units from file to make the selection.Whole field intact.a row for the whole field. You must set the field tag in Old Tag
25、first!This is when AU is wiritten in Old Tag.1 GARDIAL SF; BIEHAL GJFrequency distributionBelow the file list you may choose to calculate frequencies from the outfile, or any other file with the same format, using the various units in the list. You may also decide if the frequency list should be sor
26、ted in descending order by frequency - check Sort descending. In some cases you may wish to remove duplicates in a field, for example when counting papers by country. If so check Remove duplicates. The box Min number will ignore frequencies lower than typed in!When selecting type of unit, you must h
27、ave a relevant out-file. Some of the options refer to cited reference and thus needs a corresponding out-file, based on the cr-field. Similarly, main organization and country needs an out-file based on the cs- field.The frequency distribution will be saved in a cit-file (filename.cit).Supppose you h
28、ave the memory.out looking like this based on the au-field:1BIEHAL GJ1GARDIAL SF2TULVING E3MOSCOVITCH M4BACKMAN L4HARDY J4NILSSON LG4WINBLAD B5WEINGARTNER H6ROBERTS JV.then memory.cit will look like this (Sort descending is checked, and Whole string is chosen): 5KAUSLER DH4KASZNIAK AW4ACKERMAN BP4WI
29、LSON RS4GLOVER JA3PARKIN AJ3JACOBY LL3WOLTERS G3PUCKETT JM3BACON LD3TILL RE3TULVING E3YESAVAGE JAMake a new out-fileBibexcel allows you to make a new out-file using the units defined in the same list that is used for frequency calculations. For example, you may need an out-file with country names in
30、stead of whole addresses. Then you just choose Country from the list, and then check Make new outfile. If you wish to remove duplicate countries (or any other unit) within a fieldcheck also Remove duplicates.The new out-file will have the extension filename.oux.Suppose you made memory.out using the
31、CR-cited reference option. Then the units of the first document will look like this (one row for each cited reference):1BATRA R, 1983, V10, P309, ADV CONSUM RES1BETTMAN JR, 1979, INFORMATION PROCESSI1BETTMAN JR, 1980, V7, P148, ADV CONSUMER RES1BETTMAN JR, 1980, V7, P234, J CONSUMER RES1BIEHAL G, 19
32、82, V8, P431, J CONSUMER RES1BIEHAL G, 1983, V10, P1, J CONSUM RES1COLLINS AM, 1975, V82, P407, PSYCHOL REV1CRAIK FIM, 1972, V11, P671, J VERB LEARN VERB BE1EAGLE M, 1964, V68, P58, J EXP PSYCHOL1JOHNSON EJ, 1984, V11, P542, J CONSUMER RES1LEAVITT C, 1981, V8, P15, ADV CONSUMER RES1MITCHELL AA, 1981
33、, V8, P25, ADV CONSUMER RES1PETTY RE, 1981, COGNITIVE RESPONSES1PETTY RE, 1983, V10, P135, J CONSUM RES1TULVING E, 1966, V5, P381, J VERB LEARN VERB BE1TULVING E, 1971, V87, P1, J EXP PSYCHOL1TULVING E, 1973, V80, P352, PSYCHOL REVIf you select memory.out, chose the option cited author and check mak
34、e new outfile and remove duplicates, then the new file memory.oux will look like this(note that duplicate authors Bettman, Biehal, Petty, Tulving have been reduced to one row each):1BATRA R1BETTMAN JR1BIEHAL G1COLLINS AM1CRAIK FIM1EAGLE M1JOHNSON EJ1LEAVITT C1MITCHELL AA1PETTY RE1TULVING EFractional
35、izeIf a field contain several units, for example authors, you may wish to give each unit a fraction. If there are three authors each would get 1/3 as a fraction rather than 1 when counting frequencies. Fractionalization makes the sum of fractions equal to the sum of documents. If you dont fractional
36、ize muli-authored documents you cannot say author x has produced 10 out of 100 papers, or 10 percent of the papers, what you could say is that author x is found among the authors in 10 percent of the papers or if the base is the sum of authorships author x has 10 of, let磗 say 300 authorships.If you
37、take the memory.oux above and with make new outfile and fractionalize checked the memory.oux will look like this, in which the sum of fractions will be 1 (in fact 1.001):1BATRA R0.0911BETTMAN JR0.0911BIEHAL G0.0911COLLINS AM0.0911CRAIK FIM0.0911EAGLE M0.0911JOHNSON EJ0.0911LEAVITT C0.0911MITCHELL AA
38、0.0911PETTY RE0.0911TULVING E0.091or if you would not check make new outfile the memory.cit file would have the following fraction sums sorted in descending order via Excel:14.442CRAIK FIM3.718TULVING E1.501ATKINSON RC1.559BADDELEY AD1.577HASHER L2.078BOWER GH1.686JACOBY LL1.727KINTSCH W1.889PERLMUT
39、TER M1.926EYSENCK MW1.523BRANSFORD JD1.403PAIVIO A0.558ROGERS TB1.025CERMAK LS0.554NEISSER U0.987SMITH ADpared to non fractionalized frequencies:243CRAIK FIM78TULVING E50BOWER GH48BADDELEY AD47JACOBY LL41KINTSCH W37EYSENCK MW36UNDERWOOD BJ35ANDERSON JR35BRANSFORD JD34PERLMUTTER M34ATKINSON RC33HASHE
40、R L30PAIVIO ABibexcel helpUnits per recordIf you wish to know how many units there are in a given field in the whole doc-file you may use this procedure. If you want to know the degree of co-authorshipsthis procedure takes the out-file (or corresponding file), and counts the number of co-authored pa
41、pers and also gives information how many papers that has one author, and so on. Results in filename.mul.ExampleIf you select memory.doc and then make an out-file choosing Any ; separated field and type AU in the Old Tag-box and press Prep, then an outfile with authors is produced. Then select memory
42、.out and go to Analyze/Units per record. memory.mul wil look like this:N of recordswith n units8519524231448526246 docs and 509 units countedThere are 85 records with one author, 95 with two authors and so on.The memory.mut file will look like this: one row for each document:122131445161Document num
43、ber 1 has 2 authors, nr 2 and 3 have one etc. Lotka-like distributionsAlthough this routine was not initially intended for it, you may create a distribution that tells you how many authors have written one paper, two papers etc. Similarly. taking the memory.cit file containing a list of most cited a
44、uthors:243CRAIK FIM78TULVING E50BOWER GH48BADDELEY AD47JACOBY LL41KINTSCH W37EYSENCK MW36UNDERWOOD BJ35ANDERSON JR35BRANSFORD JD34PERLMUTTER M34ATKINSON RC33HASHER L30PAIVIO A .the memory.mut will look like this, one author being cited in 243 articles and so on to the last row which tells us that 19
45、07 authors have been cited by only 1 article:2431781501481471411371361352342331301291281263233224213204184176164154145138121011910109238247416375614783147236311901Using the inverted square-law of productivity invented by Lotka, the number of authors receiving x citations should be proportionally equ
46、al to 1/x2. Applying this to our data the estimated distribution comes quite close to the observed:xObs.Est.119011901236347531472114781195617663753741398243092323101019119161210131381114510154816471767184620452134224423342633281229123012331234223522361137114111471148115011781024310Bibexcel helpco-oc
47、curence analysis.How often do authors collaborate, which are the most co-cited documents, are some journals more co-cited than others? Such questions call for a co-occurence analysis.The starting point will be an out-file (filename.out) or any correspondingfile with the same structure as the out-fil
48、e. Since the number of pairs that canbe formed is exponentially related to the number of documents and units within them, you should first decide which units to work with.This can be done in two ways:Select units via listboxPut the cit-file (frequencies) in The List and select the most frequent unit
49、sand then you make the pairs by clicking on: Make pairs via listbox.This example shows the memory.coc file that will contain the pairs of most cited authors:77CRAIK FIMTULVING E50BOWER GHCRAIK FIM48BADDELEY ADCRAIK FIM47CRAIK FIMJACOBY LL41CRAIK FIMKINTSCH W37CRAIK FIMEYSENCK MW35ANDERSON JRCRAIK FI
50、M35BRANSFORD JDCRAIK FIM35CRAIK FIMUNDERWOOD BJ34ATKINSON RCCRAIK FIM34CRAIK FIMPERLMUTTER M33CRAIK FIMHASHER L29CERMAK LSCRAIK FIM29CRAIK FIMPAIVIO A28CRAIK FIMPOSTMAN LSelect units via fileYou may also prepare a file containing the units. That file has to have the extension:filename.untThen you mu
51、st select the out-file. From that file all units in filename.unt will be selected. Consequently, the new outfile filename.uot will be shorter.Then you make the pairs by selecting: Make pairs via fileThis procedure needs a file of the out-type (uot-file is also nice with fewer units). .with frequenci
52、esmeans that the number of co-occurences are calculated .dont count frequenciesmeans that all pairs with the document numbers are listed. Good to have that option since we may wish to add labels to the pairs.For example, the publication year in which a co-author pair was formed.The co-occurence resu
53、lt will be saved in a file: filename.coc.Make a matrix.You may need to put co-occurrences in matrix form, for example when making a MDS based map.1. First, view the cit-file to select the units to be included in the matrix2. Then select the pair-file (freq+tab+pairleft+tab+pairright). It can be a co
54、c-file for co-occurence, but any file with the same format and containing the relevant units can be used. 3. Then select Make a matrix from the analyze menu. You will be asked if you want to make a lower left matrix or a squared matrix. The matrix will be saved in filename.ma2.Example on how to make a co-occurence matrix of cited authorsThis is an example from the memrory.doc file. 1. Select memory.doc and then select CR- cited reference and pre
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 志愿者协会安全责任制度
- 房东申报责任制度
- 托管经营安全责任制度
- 扬尘治理三方责任制度
- 技术员责任制度
- 护士层级责任制度
- 押运岗位责任制度
- 挡粮门维修责任制度
- 控违拆违责任制度
- 操作系统岗位责任制度
- 物业小区控烟监督制度
- 2026年郑州市检验检测有限公司公开招聘19人笔试备考题库及答案解析
- 2026年春季安全教育班会记录表(19周):开学安全第一课-启航安全守护新学期
- 多模式镇痛临床实践与应用
- 2025年黄山职业技术学院单招职业技能测试题库附答案解析
- 2026吉林农业大学三江实验室办公室招聘工作人员笔试备考试题及答案解析
- 脑中风科普知识讲座
- 大坝安全监测仪器检验测试规程
- 绿色数据中心 暨对算力行业的一点思考 行业洞察 2026
- 历史试题-汕头市2025-2026学年度普通高中毕业班教学质量监测(含解析)
- 部队食堂制度规范标准
评论
0/150
提交评论