版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
TM
HOWAIIS
RE-ARCHITECTING(→)THEDATA
CENTEROF
THEFUTURE
PRAvEENMIDHA“WHITEPApERŊ
GopALSHARmA
SEpTEmBER.2025
HOWAIISRE-ARCHITECTING(→)
THEDATACENTEROFTHEFUTURE
ExecutiveSummary
Artificialintelligence(AI)hasproventobeaparadigmshiftforworkloadsinthemoderndatacenter.Ithasreshapedindustryperspectivesoneverythingfromdataprocessingtopowerconsumption.ThiswhitepaperexaminesAI’stransformative
impact,includingtherisingtemperatureofdatautilization,thegrowingenergydemandsofacceleratedcomputing,andthe
limitationsofcurrentstoragearchitectures.Weexplorehownewandinnovativetechnologiesarebridgingthesegapstomeettheneedsoftomorrow'sAI-drivenworkloads.
ThreeKeyTrendsintheDataCenter
1.AI:AMissionCriticalEnterpriseWorkload
AIhaslongexistedinthetechecosysteminvariousforms,butwhenagamechangingAIchatcametomarketinNovember2022,itsparkedarevolutionincomputecenteredaroundLargeLanguageModels(LLMs)andotherGenerativeAI(GenAI).Thisnewworkloadprovidesexcitingnewcapabilitiesthatinturnprovidedasurgeofinterestandinvestment.
AIITspendingisprojectedtogrowatacompoundannualgrowthrate(CAGR)of31.92%,from$315.9billionin2024to$1.262trillionby2029.Bythen,AIspendingwillconstitute16.4%oftotalITexpenditures.1
CAGR22%
CAGR39%
ServiceprovidersandenterprisesareheavilyinvestinginrelativelynascentanddistinctAIinfrastructure.Accordingtoarecentreport,datedAugust4th2025,the2025GlobalCloudCapExisnowtrackingto$445Band56%Y/Ygrowth,withupsideto
+30%growthin2026.2
WeareexitingtheearlystagesofwhatsomehavecalledtheAIgoldrush,anditisalreadyclearthatAI’srisesignifiesa
paradigmshiftintheentiredatacenterarchitecture,withimplicationsforstorage,compute,andnetworking.Attheendof2024,SandiskintroducedanewAIDataCyclestorageframeworktohelpcustomersunlockAI’spotential.
Learnmoreabouttheframework
2.RisingGlobalDataTemperatures
DatafuelsAI.Atitscore,AIisfundamentallyaboutdatautilizationanddatageneration.LLMsthriveonever-expanding
datasetstoenhanceaccuracy,versatility,andbiasmitigation.Intheprocess,AIsystemscreatenewdata,makingexistingrepositoriesmorevaluableformodelcontextandlearning.
ThecurrenttrendforpublisheddatapointsforLLMsshowsthattrainingdatasetsizesaregrowingdramatically.AImodels
operateinacontinuousloopofdataconsumptionandgenerationandthevastmajorityofthisdataisunstructured.However,thisinsatiabledemandfordatacreateschallengesforstoragearchitects.
HerearesomekeyinsightsfromIDC’s2025GlobalDatasphere3:
•Datagenerationgrowth:24%CAGRfrom2023to2028
•Enterprisedataoutpacingconsumerdata:30%CAGR
•Unstructureddatadominance:92.9%ofalldatain2023
TM
PG.02μWHITEpApERŊHowAIIsRE-ARcHITEcTINGTHEDATACENTERoFTHEFuTuRE
LLMshavebecomemultimodal,processingdiversedatatypessuchastext,video,andimages.Theevolutionofdata-richgenerativeAIchatmodelsimpactsthecapacityrequirementsofunderlyingstorage,andthespeed,latency,retrievability,andscalability.
TooptimizetrainingtimeandGPUusage,storagearchitecturesmustquicklydeliverrelevantdatasetstoGPUs.IDCcategorizesdataintothreelatencytypes4:
•UltraReal-Time(<40mstransferlatency)
•Real-Time(40ms–200mslatency)
•Nominal-Time(>200mslatency)
AsdatalatencytrendstowardsReal-TimeandUltraReal-TimeinGPUcentricworkloadsthereisagrowingneedforfaster
storageandretrievalindatacenters.Serviceprovidersarerespondingtothiswithinnovationslikeamajorcloudprovider’sS3optimization(Nov2023),offeringupsignificantlybetterperformanceforAIandMachineLearning(AI/ML)trainingwithsingle-digitmillisecondlatency.5
3.Power:TheNewMetricforDataCenterCapacity
Datacentercapacityhastraditionallybeenmeasuredbyfloorspacesize.However,AI-drivenworkloadsareinherentlymorepowerintensivethanmainstreamworkloads.WhileGPUsaremorepowerefficientthanCPUsforAImodeltraining,the
amountofGPUsrequiredtotrainthelatestmodelsismeasuredinthetenstohundredsofthousandsofunitsoperatingconstantlyatfullcapacityformonths.Likewise,AIinferencingworkloadsleveragelargeclustersofGPUstogenerate
responses,andconsumemorepowerthantraditionalanalyticsordatabaseworkloads.
AIdatacenterpowerconsumptionisgrowing:
•AIdatacenterenergyconsumptionisprojectedtogrowata19.5%%CAGR,between2023and2028.6
•TheshareofU.S.powerdemandconsumedbydatacentersisexpectedtoincreasefrom4.4%todaytobetween6.7and12%by20287
Powerhasbecomeacriticalmetricfordatacenters,incurringthelargestoutgoingexpenseofdatacenteroperationand
accountingfor60%oftotalspendingforserviceproviders.6Risingelectricitypricesandgrowingdemandunderscorethe
importanceofenergy-efficientarchitectures,andacomplexinterplayoffactorswillmakesupplymoreunpredictableand
costly,suchasenvironmentalregulation,geopoliticalfactorsandextremeweatherevents.Powerhasemergedasthenewlyimportantunitofmeasureasitbetterrepresentsdatacentercapacitythanfloorspace.
TheEconomicsof“StateoftheArt”StorageHDDsDominateToday,butChangeIsComing
In2024,TapeandSSDsrepresentedacombined20%whileHDDsrepresentedtheremaining80%ofthehyperscalestoragemediashareofinstalledbaseofcapacity.8
EBInstalledBaseShare6
Layeredonthisstoragemedia,CloudServiceProviders(CSPs)offeraccesstoallthreestorageprotocols:Block,File&Object.
WhileBlockandCloudFilestoragehavesmallerfootprints,Objectstorageisthelargestandfastest-growingsegmentindatastorage.Blockstorageworkloadsareperformance-demanding,latency-sensitive,andhaverandomaccesspatterns(IOPS).
CloudFilestorageisdesignedforsharedfileaccess,withbothBlockandFileusingamixofSSDsandHDDs.
TM
PG.03μWHITEpApERMHowAIIsRE-ARcHITEcTINGTHEDATACENTERoFTHEFuTuRE
Objectstorageemphasizesscaleandeconomics,withcommonusecaseslikebackup&restoreandarchiving.Asa
hypotheticalexample,amajorcloudproviderboasts350trillionobjects,amountingtoexabytes(possiblyzettabytes)ofdata.Objectstorageworkloadsaremostlysequential(bandwidth),favoringHDDs.
Givenitsfocusonscaleandcost-efficiency,it'snosurprisethatmostclouddatahastraditionallybeenstoredonHDDs.
TheChangingWorkloadMix
AIisdrivingtheadoptionofdatalakesasanemergingusecase,besidesthetraditionalbackup&restoreandarchiveforobjectstorage.Datalakesarecentralizedrepositoriesthatstoreandprocesslargeamountsofunstructureddata,andtheysupport
bothtrainingandinferencetasks.Unliketraditionalworkloads,datalakesrequirehighperformanceandlowlatency.HDDsarestrugglingtomeetthesedemands,pavingthewayforSSDsthatcanchecktheboxesforbothperformanceandcapacityinperformance-criticalscenarios.
ServiceprovidersarerespondingtothisgrowingAIworkloadbyintroducinganewstorageclassforobjectstoragethatis
optimizedtodeliverasignificantperformanceupliftneededbytheseworkloads.Thisrequiresasolutionthatcancheckboththeboxesofhigherperformanceandlargercapacity.
9
Asthefollowinggraphshows,HDDperformancehasstagnatedovertheyears,whileperformanceperterabytehassteadilydecreasedascapacityhasincreased.
HDDThroughputandCapacityGrowth
CSPshavetypicallyofferedseveralstorageclasses,whichcanbebroadlydividedinto3buckets:Standard,InfrequentAccessandArchival.ThesehavetraditionallybeenservedfromHDD,andwithsomearchitecturalchangesHDDscouldperformas
neededwithover-provisioningforcapacityorperformance.WiththerecentdemandsofAI,CSPshavestartedtooffera
performance-optimizedtierthatisdifficulttoachievesimplybyaddingmorecapacity-thisiswherehighcapacitySSDsarefitforpurpose.
TotalCostofOwnership(TCO)IsKey
Historically,storagebuildoutshavebeendrivenbydeviceacquisitioncosts,giventheavailabilityofabundantspace,low-
costpower,andreasonableperformancemetrics.Onaverage,theprice-per-gigabyte(ASP$/GB)forflashstoragehasbeenapproximatelysixtimeshigherthanHDDs,makingHDDsthecleareconomicchoice.
However,thisequationisshifting.Risingelectricitycosts,increasingpowerdemandsfromGPUs,andtheneedforfasterdatalakesarereshapingthestoragelandscapeforAIworkloads.Asaresult,TotalCostofOwnership(TCO)isbecomingamore
relevantmetricthanASP($/GB)alone.
TCOincludesboth:
•Acquisitioncosts:Servers,storage,networking,andsoftware.
•Operationalcosts:Infrastructure(floorspace,power—bothidleandactive),labor,support(hardwarereplacements,dataprotection,reliability).
BeyondTCO,anothercriticalshiftismovingfromrawstoragetoeffectivestorage,whichprovidesamoreaccuratemeasureofstorageefficiency.Effectivestorageconsidersfactorssuchas:
•Capacityutilization
•Dutycycle(activevs.idle)
•Replicationfactor
•Performancemultiplier
•Datareductionratios
TM
PG.04μWHITEpApERMHowAIIsRE-ARcHITEcTINGTHEDATACENTERoFTHEFuTuRE
Asthedemandforbothstoragedensityandperformancedensitygrows,evaluatingtheavailableHDDandSSDcapacitypointsbecomesessential.
ProjectedSSDandHDDCapacityGrowth9
LetsassumetwovariationsofSSD/HDDASP($/GB)withmultiplesof6xand5x.
UsingtheseassumptionsandwhatwehavelearnedaboutthenewdeploymentssupportingAIworkloadsinthedatacenter,webuiltaTotalCostofowernship(TCO)modelforahypotheticalgreenfielddatacenterwith1EB(1000PB)ofstorage.Themediaoptionsareall-HDDwiththehighestavailable30TBHDDvsall-SSDwithvariouscapacitypointsfrom30Tto240T.
SSD/HDDTCORatio10
ItisclearfromthechartthatasSSDcapacitiesgoup,theirTCOsstarttolookmorecompetitivevsHDD.Inthesecondchart,wekepttheASPmultipleat6xbutdoubledthecostofpower.Athigherpowercost,theall-SSDTCOstartstolookmore
attractiveatlowercapacitypoints.
NOTE:ForSAS/SATA,theconversionfromHDDtoSSDwasrealizedat3xmultiple
PerformanceConsiderations
ThelatestSSDcapacitypointsleverageQuad-LevelCell(QLC)NANDtechnology,whichincreasesbitdensityby33%
comparedtoTriple-LevelCell(TLC).WhileQLChaslowerrandomread/writeperformance,lowersequentialwritespeeds,
andlowerendurancethanTLC—alongwithslightlyhigherwritepowerconsumption—itsincreasinglayercountandcompactpackagingcontinuetodrivecapacityadvancements.
Moreover,QLCSSDsarewell-suitedforread-heavy,large-blocksequentialworkloads,aligningwithobjectstorageusecases.DespiteQLC’strade-offs,SSDssignificantlyoutperformHDDs,offering:
•MassiveIOPSadvantagesinrandomworkloads.
•2–3xhighersequentialthroughput(bandwidth)comparedtoHDDs.11
•SuperiorscalabilitythroughtheNVMe™interface,unlikeHDDs,whichremainconstrainedbylegacySAS/SATAinterfaces.
TM
PG.05μWHITEpApERηHowAIIsRE-ARcHITEcTINGTHEDATACENTERoFTHEFuTuRE
ClosingThoughtsontheGrowingRoleofAIinDataCenters
AIisrapidlyemergingasastrategicworkloadinthedatacenter.Theracetodevelopnext-generationLLMsiswellunderway,withthesemodelsexpectedtobemorecompute-intensiveonaverage.AstheytrainonlargerdatasetswithfasterGPUs,thedemandsonstoragearchitectureareevolving.Futurestoragesolutionsmustnotonlyscaleincapacityandspeedbutalso
optimizepowermanagementmoreintelligentlythanlegacyalternatives.
IDC'sviewisthatGenAIwillhaveagreaterpositiveimpactonSSDandmemoryspendingthanHDDs.12
AtSandisk,we’reexcitedaboutthefutureofAI-drivenstorageinnovation.Ournext-generationSSDsarebeingdesignedwith
highercapacities,continuedperformancescalingthroughPCIe™Gentransitions,andconfigurablepowerprofilesthatalignwiththeperformanceandcoolingrequirementsofmoderndatacenters.
AsSSDcapacitiesgrow,newtechnicalchallengesandopportunitiesarise—suchasblastradiusmanagement,dataretentionvs.power-performancetradeoffs,indirectionunit(IU)size,anddataplacementstrategies.AtSandisk,werecognizethese
challengesandareactivelyinnovatingtoaddressthem,ensuringthatourstoragesolutionsmeettheevolvingneedsofAI-powereddatacenters.
1.IDC,WorldwideArtificialIntelligenceITSpendingForecast2024–2028,#US52635424,October2024.
2.MorganStanleyResearch,August42025,CloudCapex.
3.IDC,WorldwideGlobalDataSphereStructuredandUnstructuredDataForecast,2024–2028#US52554824September2024.
4.IDC,StreamingandReal-TimeDatainIDC’sGlobalDataSphere,#US53122225,2025.
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 复读机构引流运营方案
- 医院运营管理建设方案
- 测试公众号运营方案
- 物流运营方案创新
- 小美抖音运营方案
- 新航线运营方案
- 付费课程自媒体运营方案
- 外卖小店运营方案范文
- 公司宣传小组运营方案
- 小县城夜市运营方案
- 2026浙江台州路桥区第二人民医院医共体招聘10人备考题库含答案详解
- 2025年宜宾市公安局翠屏区分局招聘警务辅助人员考试试卷真题
- 2026年气象局公务员遴选面试指南与气象服务解析
- 五 长方形和正方形 第1课时 认识相交与平行 课件 内嵌视频 2025-2026学年苏教版三年级数学下册
- 2026山东济南新旧动能转换起步区招聘40人备考题库附答案详解
- 2026草原资源保护课件
- 2026春统编版语文 24 大禹治水 教学课件
- 2026年高考英语作文高分全景备考体系:模板 + 万能句型 + 实战指南
- 高中生物2019新人教版必修二旁栏问题及课后练习答案(含所有章节)
- 《聚氨酯发泡机设计(论文)》
- GB/T 5147-2003渔具分类、命名及代号
评论
0/150
提交评论