版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
VexedonVectors?THR2918Connor
McDonaldDatabaseAdvocate@connor_mc_dhttps://linktr.ee/connorLiketalkingtech?SodoICopyright
©2025,Oracleand/or
its
affiliates2so
…Copyright©2025,Oracleand/or
its
affiliates3…
lets
be
honestCopyright©2025,Oracleand/or
its
affiliates4egoCopyright©2025,Oracleand/or
its
affiliates5before
2023Copyright
©2025,Oracleand/or
its
affiliates6then
…
itchangedCopyright
©2025,Oracleand/or
its
affiliates7the
basicspertains
todataCopyright
©2025,Oracleand/or
its
affiliates8vectorsCopyright
©2025,Oracleand/or
its
affiliates9
50
21
16
42
33fundamentaldatastructureCopyright
©2025,Oracleand/or
its
affiliates10this
is
not
newCopyright
©2025,Oracleand/or
its
affiliates11"Dude…you
invented
arrays、"Copyright
©2025,Oracleand/or
its
affiliates12Copyright
©2025,Oracleand/or
its
affiliates13key
pointCopyright
©2025,Oracleand/or
its
affiliates14mapdata
to
a
vectorCopyright
©2025,Oracleand/or
its
affiliates153342162150VectorCopyright
©2025,Oracleand/or
its
affiliates16"Dude…you
invented
hashing、"Copyright
©2025,Oracleand/or
its
affiliates17WORDORA_HASH(WORD)apple1759851605book2877951221cat836612377dog4112090448house4281657745car3527616540tree2634164408chair3389199612phone1471878889table4071112899SQL>selectword,
ora_hash(word)2from
nouns;Copyright
©2025,Oracleand/or
its
affiliates18sowhat
isthe
"breakthrough"?Copyright
©2025,Oracleand/or
its
affiliates1920Copyright
©2025,Oracleand/or
its
affiliates10,000GPUsweeks
of
compute$$$$$21
Copyright
©
2025,
Oracle
and/or
its
affiliates
Credit:
Intro
to
Large
Language
Models,
Andrej
Karpathy
vectors100GBdata10TBvector=
point
in
n-dimensionalspaceCopyright
©2025,Oracleand/or
its
affiliates22Copyright
©2025,Oracleand/or
its
affiliates23vectors
preserve"meaning"Copyright
©2025,Oracleand/or
its
affiliates24tokenizationCopyright
©2025,Oracleand/or
its
affiliates25tiktokenizer.vercel.app26Copyright
©2025,Oracleand/or
itsaffiliateseachtoken
becomesavectorCopyright
©2025,Oracleand/or
its
affiliates27{Be}{fore}{you}{up}{grade}
{take}
{a}
27
61
12
34
7
50
21
16
42
33
52
19
37
76
39
36
2545
72
32
32
2954887
236548
79
42
45
66
41
68
50
Copyright
©2025,Oracleand/or
its
affiliates28inputstoa
neural
networkCopyright
©2025,Oracleand/or
its
affiliates29Y
∑x2∑y2o
o
∑xym
=
…etc
XCopyright
©2025,Oracleand/or
its
affiliates30what
iftherewas
noformula?Copyright
©2025,Oracleand/or
its
affiliates31Copyright
©2025,Oracleand/or
its
affiliatesXY32Copyright
©2025,Oracleand/or
its
affiliatesXY33{Be}{fore}{you}{up}{grade}
{take}
{a}Copyright
©2025,Oracleand/or
its
affiliates34=
1.102=
-1.0391.414
x0.92
x0.78-1.13Copyright
©2025,Oracleand/or
its
affiliates350.0631.414x
0.780.92x
-1.13=
-1.039=
1.102Copyright
©2025,Oracleand/or
its
affiliates36{Be}{fore}{you}{up}{grade}
{take}
{a}sleepbackup
pillglanceCopyright
©2025,Oracleand/or
its
affiliates…37{Be}{fore}{you}{up}{grade}
{take}
{a}sleepbackup
pillglanceCopyright
©2025,Oracleand/or
its
affiliates…38{Be}{fore}{you}{up}{grade}
{take}
{a}sleepbackup
pillglanceCopyright
©2025,Oracleand/or
its
affiliates…39in
reality
…
little
moresophisticatedCopyright
©2025,Oracleand/or
its
affiliates40Beforeyou
upgradetakea
backupCopyright
©2025,Oracleand/or
its
affiliates41take
a
backupBeforeyou
upgrade
databaseCopyright
©2025,Oracleand/or
its
affiliates42take
a
backuprecycleold
oneBeforeyou
upgrade
TVCopyright
©2025,Oracleand/or
its
affiliates43take
a
backuprecycleold
one
wipe
driveBeforeyou
upgrade
harddiskCopyright
©2025,Oracleand/or
its
affiliates44take
a
backuprecycleold
one
wipe
driveturnwater
offBeforeyou
upgrade
plumbingCopyright
©2025,Oracleand/or
its
affiliates45context
is
importantCopyright
©2025,Oracleand/or
its
affiliates46Copyright
©2025,Oracleand/or
its
affiliates47Let's
buildtheGPTTokenizer-Andrej
KarpathyLet's
buildGPT:fromscratch,
incode,spelled
out
-
Andrej
Karpathymore
readingCopyright
©2025,Oracleand/or
its
affiliates48butthe
best
part
…Copyright©2025,Oracleand/or
its
affiliates49we
don't
needto
know
!Copyright
©2025,Oracleand/or
its
affiliates50allwecare
about
isthe
final
modelCopyright
©2025,Oracleand/or
its
affiliates51why
?Copyright
©2025,Oracleand/or
its
affiliates52modelyieldsour
vectorsCopyright
©2025,Oracleand/or
its
affiliates53considercolor
…
RGBas
our
"vector"Copyright
©2025,Oracleand/or
its
affiliates54Copyright
©2025,Oracleand/or
its
affiliatesGBR55●
{75,10,10}Copyright
©2025,Oracleand/or
its
affiliatesGBR56●
{75,10,10}●
{50,70,70}Copyright
©2025,Oracleand/or
its
affiliatesGBR57●
{75,10,10}●
{50,70,70}●
{25,75,75}Copyright
©2025,Oracleand/or
its
affiliatesGBR58G●
{75,10,10}●
{50,70,70}●
{25,75,75}Copyright
©2025,Oracleand/or
its
affiliatesBR59distance=
similarityCopyright
©2025,Oracleand/or
its
affiliates60G●
{75,10,10}●
{50,70,70}●
{25,75,75}Copyright
©2025,Oracleand/or
its
affiliatesBR61GCopyright
©2025,Oracleand/or
its
affiliatesBR62GBCopyright
©2025,Oracleand/or
its
affiliatesR63G
cos
=0.3
cos
=0.8Copyright
©2025,Oracleand/or
its
affiliatesBR64the
modeldoesthiswith
dataCopyright
©2025,Oracleand/or
its
affiliates65d1elephantdogcat
lionwolf
puppy
kittenpearplumapple
strawberryNewYorkCaliforniaraspberryblackberryCopyright
©2025,Oracleand/or
its
affiliateskiwiTexasd266pearplumapple
strawberryelephantcat
lionpuppyNewYorkCaliforniaraspberryblackberrydogwolfCopyright
©2025,Oracleand/or
its
affiliateskittentigerkiwiTexasd2d167wecan
dothis
with
any
dataCopyright
©2025,Oracleand/or
its
affiliates68bring
it
backtodatabaseCopyright
©2025,Oracleand/or
its
affiliates691)
needtogenerate
modelsCopyright
©2025,Oracleand/or
its
affiliates7010,000
GPUs
?Copyright
©2025,Oracleand/or
its
affiliates71DBMS_DATA_MINING.import_onnx_model(model_name=>
'All-MiniLM-L6-v2',model_data=>
'All-MiniLM-L6-v2.onnx'...);Copyright
©2025,Oracleand/or
its
affiliates722)
needtostore
vectorsCopyright
©2025,Oracleand/or
its
affiliates73CREATE
TABLE
recipes(id
NUMBER,description
CLOB,photo
BLOBmy_vector
VECTOR(768,FLOAT32));Copyright
©2025,Oracleand/or
its
affiliates743)
needtocreate
vectorsCopyright
©2025,Oracleand/or
its
affiliates75SELECTVECTOR_EMBEDDING(All-MiniLM-L6-v2USING
'cake
recipes
containing
citrus');Copyright
©2025,Oracleand/or
its
affiliates764)findvector
distancesCopyright
©2025,Oracleand/or
its
affiliates77SELECTFROM
ORDER
BYvector_distance(description,:myvector);recipes…Copyright
©2025,Oracleand/or
its
affiliates78demotoday3:30pm
:-)Copyright
©2025,Oracleand/or
its
affiliates79anotherchallenge80howtosearchthisfast?81vectorindexes82neighbourhood
partitioned.'
.83neighbourhoodgraph7
6
943
1289085key
point90accuracy
|cost
tradeoff91SQL>createvector
indexDOCUMENT_VEC_IX2on
DOCUMENTS(fragment)3organizationINMEMORY
NEIGHBOR
GRAPH4distance
COSINE5withtarget
accuracy
95;Index
created.92
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026年大学大一(建筑学)建筑制图基础综合测试题及答案
- 护理部护理实践技能评估
- 2025年前台电子练习卷
- 护理质量改进的领导力
- 临床医学教材课件内科学第八篇风湿性疾病第六章抗磷脂综合征
- 小学语文部编版习作教学策略应用于案例分析
- 2026六年级数学上册 分数除法学习策略
- 2026六年级数学上册 百分数计算技巧
- 2026年医疗废物规范化管理督导工作计划
- 消防安全隐患排查整治方案
- 2025年税务局上海面试题及答案
- 二方审核管理办法
- 北京政务云管理办法
- 学堂在线 雨课堂 学堂云 工程伦理2.0 章节测试答案
- 道法人须有自尊课件-+2024-2025学年统编版道德与法治七年级下册
- 2.3地域文化与城乡景观 课件
- T/CIE 115-2021电子元器件失效机理、模式及影响分析(FMMEA)通用方法和程序
- 国土空间规划概述
- GB 5768.1-2025道路交通标志和标线第1部分:总则
- 《水遇冷以后》说课(附反思板书)(课件)四年级下册科学苏教版
- 园长陪餐管理制度
评论
0/150
提交评论