GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审

上传人：媚*** IP属地：境外上传时间：2024-04-17 格式：DOCX 页数：16 大小：51.37KB 积分：12 举报 版权申诉

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审_第2页

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审_第3页

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审_第4页

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审_第5页

已阅读5页，还剩11页未读，继续免费阅读

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

EasyChairPreprint

№11192

BeyondCodeAssistancewithGPT-4:LeveragingGitHubCopilotandChatGPTforPeerReviewinVSEEngineering

RoarEliasGeorgsen

EasyChairpreprintsareintendedforrapiddisseminationofresearchresultsandareintegratedwiththerestofEasyChair.

October28,2023

BeyondCodeAssistancewithGPT-4:LeveragingGitHubCopilotandChatGPTforPeerReviewinVSEEngineering

RoarEliasGeorgsen[0000−0003−2868−497X]

UniversityofSouth-EasternNorway,Raveien215,3184Borre,Norway

roar.e.georgsen@usn.no

Abstract.MostcompaniesareVerySmallEntities(VSEs),meaningtheyhavefewerthan25employees.Primarilydomainspecialists,thesecompanieslackin-houseexpertiseinimportantareassuchassecurityandreliabilityengineering,processimprovement,QualityManagement(QM)andSystemsEngineering(SE).VSEsstruggletoadheretoStan-dardOperatingprocedures(SOP),andresearchhasshownthatcon-tractualobligationstofollowindustrystandardsandbestpracticeshavelittleeffectonactualengineering.ThispaperdescribesacasestudythatexploredthepotentialofLargeLanguageModels(LLMs)tosupportengineeringbestpracticesataVSEbytakingontheroleofanexpertpeerinareaswherethecompanyhadaskillsgap.Aiwell,aNorwegianproducerofbuildingautomationequipment,usedChatGPT,GitHubCopilotandGPT-4toassessthequalityoftheirsystemandstakeholderrequirements.AGPT-4foundationmodelwithnoadditionaltrainingwasgivenlinkstoreferencematerialsonrequirementsengineeringpro-ducedbyTheInternationalCouncilonSystemsEngineering(INCOSE)andallowedtoparticipateindiscussionsonthesamedigitalcollabora-tionplatformasthehumanengineers.ThestudyfoundthatAI-assistedrequirementreviewsimmediatelyandpositivelyimpactedtheentireen-gineeringprocess,supportingthefeasibilityofintegratingadvancedAItechnologiesinVSEs,evenwithlimitedtrainingandresources.Partici-pantshighlightedthecomplementarynatureofhumanintelligenceandAI,whereLLMsaugmentedhumanjudgmentthroughdialogue,leadingtoenrichedengineeringpractices.Ethicalanddataprivacyconsidera-tionsalsoemergedascentralthemes,emphasisingtheneedforproactivemeasures.

Keywords:generativeAI·verysmallentities·systemsengineering·re-quirementsengineering

Introduction

TheglobalsupplychainconsistsmainlyofVerySmallEntities(VSE),withaworkforcerangingfromfive(5)totwenty-five(25)people[1],andmicro-enterprises,havingfewerthannineemployees,makeup92%ofEuropeanen-terprises[2].InNorway,VSEsallocateapproximately30%oftheirspendingto

PAGE

R.E.Georgsen

BeyondCodeAssistancewithGPT-4

PAGE

researchanddevelopment(R&D),andnewproductdevelopmentmakesup20-30%ofVSErevenue,aratesignificantlyhigherthanthelessthan9%observedinlargecompanies[3,4].

Aiwell,asmallNorwegiancompanydevelopingoutdoorautomationsystems,wantedtomoderniseitsengineeringpracticesinresponsetothegrowingcom-plexityofitscustomerprojects.Withahighworkloadandlackofsenioren-gineersinthelabourmarket,Aiwellwantedtoexplorehowtechnologycouldmitigatepotentialskillsgapsandimprovequality.TheseeminglyintuitiveeasewithwhichindividualengineersusedtoolslikeChatGPTandGitHubCopilottogeneratefunctioningcodeanddocumentationmotivatedAiwelltoinitiateapilotstudy.

ObjectivesoftheStudy

ThestudywasdesignedtoexplorethepotentialofLargeLanguageModels(LLMs)toenhanceengineeringpracticesinthecontextofVerySmallEntities(VSEs).Thespecificobjectivesthatguidedthisexplorationwere:

ImpactonProductivityandQuality:InvestigatinghowengineerswithoutpriorexperiencecouldutiliseLLMstoincreaseproductivityandimprovequalityingeneral.

FeasibilityofGenerativeAIforVSEs:AssessingwhetherLLM-basedtoolscouldbeintegratedintoaVSEworkflowatanaccessiblecostandwithoutadditionalhumanresources.

ImpactonEngineeringCompetency:InvestigatinghowLLMscouldfillknowl-edgegapsandcontributetoincreasedcompetencyinVSEs,inparticularwithregardtorequirementsengineering.

Byfocusingonthesekeyareas,thestudyintendedtoprovideinsightsintohowLLM-basedtoolscanaddressspecificchallengesfacedbyVSEs.

Background

EngineeringinVerySmallEntities(VSE)

VSEslackexperienceworkingwithstandardisedprocesses[5]andoftenleantowardsinformalandorganicallyevolvedmethodologies.Theyfindstandardisedmethodstoobroadfortheirspecificneedsandvaluetheagilityandtheabilitytotailorworkflowsthatinformalityofferstosuittheiruniquecontexts[6].Smallcompaniesareconsciousoftheescalatingcustomerandlegalexpectationsforsystematicengineeringandtheirinternalneedforimprovement[5].However,theydonotviewtheseasbeneficialtotheirworkorrelevanttotheirsituation[6].Thisbeliefintheabsenceofaddedvalueleadstooppositiontochange,

andtheactualengineeringpracticesmaynotalignwithdocumentedcompliance[7].VSEsarerarelyrequiredtodocumentcompletecompliancewithspecificstandardsandthusprefertoproduceonlytheminimumdocumentationrequiredbycontractualobligations[6].Measuressuchasrisk-sharingpartnerships[8]intendedtoimprovesystemqualitycan,inthiscontext,reducetheactualqualityofthesystemasresponsibilityandaccountabilitymovedownthesupplychain.StandardizingengineeringpracticescanbevitaltoVSEsforamultitudeofreasons.ThesepracticesequipVSEswiththeresourcesandprovenmethodolo-giestoenhancethequalityandefficiencyofengineering.Theyfosterimprovedprojectmanagement,reinforceimplementationprocesses,andcontributetocom-petitiveness[5].TheISO/IEC29110seriesandsimilarinitiatives,tailoredforVSEs,canfacilitatethetransferofcodifiedknowledgerelatedtosystemsengi-neering,offeringbenefitssuchaspromotinginnovation,marketaccess,quality

control,andethicaladherence.

Standardscanprovidelegalprotection,shieldingengineersfromaccusationsofnegligence.Thefollowingscenarioin[9]involvingan8-personcompanythatdevelopedcomputer-controlledvalvesforindustriessuchaspharmaceuticalsandchemicalsillustratesthecriticalimportanceofstandardizedengineeringprac-tices,evenforVSEs.Thecustomercontractedthecompanytoinspectthere-quirementsusingIEEEsoftwareengineeringstandards.However,thedeveloperwasunawareoftheIEEE1028standard,whichdescribesthetypesofsoftwarereviewsandproceduresrequiredforexecution.Afterinstallingthenewsoftware,thecomputer-controlledvalvesmalfunctioned,causingdamagesinthechemicalplantandleadingtolegalaction.Thecourthearingrevealedthatthesuppliercouldnotprovideevidenceofaninspectionaccordingtothestandard,resultinginlegalandfinancialconsequences.Thisincidentunderscoresthenecessityofadheringtostandardizedengineeringpracticestoensurequality,safety,andle-galcompliance,regardlessofthecompany’ssize.Italsohighlightsthepotentialrisksandliabilitiesthatcanarisefromneglectingthesestandards.Inessence,standardizedengineeringpracticescanserveasacornerstoneforVSEs,ensuringquality,legalcompliance,andeconomicviability,therebycrucialtotheirsuccessandsustainability.

RequirementsandSystemsEngineering

InAiwell’sendeavourtoimproveitsengineeringpractices,requirementsqual-ityemergedasapivotalfocusarea.Theimportanceofrequirementsqualityismulti-fold.First,well-definedrequirementsserveasthecornerstoneforsuccessfulprojectoutcomes,reducingthelikelihoodofcostlyrevisionsanddelays.Second,theyfacilitateclearcommunicationamongstakeholders,ensuringalignmentonobjectivesandexpectations.Thereisastrongpositivelinkbetweencapabilityinrequirementsdefinitionandmanagement[10].Ahigh-qualityrequirementisclear,concise,verifiableandtraceable.Itshouldbedevoidofambiguity,allowingforasingleinterpretation,andverifiablethroughinspection,analysis,ortesting.Critically,arequirementshouldalsobetraceable,linkingbacktoitssource,ra-

tionale,anddependencies,whichaidsinmanagingchangesandunderstandingimpacts.

LargeLanguageModels(LLMs)

LargeLanguageModels(LLMs)areasubsetofartificialintelligencemodelsdesignedtogeneratehuman-liketext.Thesemodelshavegainedsignificantat-tentioninsoftwareengineering,particularlyinautomatingcodingtasksandim-provingcodequality.LLMshavebeenevaluatedfortheireffectivenessincodegenerationandhaveshownpromisingresultsinvariousaspectsofsoftwarede-velopment[11].Thestudyrevealedthat85%ofdevelopersfeltmoreconfidentintheircodequalitywhenauthoringwithGitHubCopilot.Moreover,codere-viewswerecompleted15%faster,and88%ofdevelopersreportedmaintainingaflowstate,indicatingincreasedfocusandreducedfrustration.LLMsarenotjustacceleratingthecodingprocessbutarealsoenhancingthequalityofthecode.GitHub’sinternalmetricsforcodequality—readability,reusability,con-ciseness,maintainability,andresilience—showedsignificantimprovementwhendevelopersusedGitHubCopilot[12].

Usingartificialintelligence,andmorerecentlyDeepLearning,tosupportengineeringisnotnew[13],butinlearningtouselanguage,LLMsarealsopickingupotherhumanskills,suchaslearningfromasingleexample[14].WhereasnotlongagotraininganAIwasanexpensiveprocessthattookalongtime,LLMfoundationmodelscanlearnbyjustlookingupthedocumentation[15].ItisthislastnewfoundabilitythatAiwellaimedtoexploitintheirpilotstudy.

CaseStudy:LeveragingLLMstoEnhanceRequirementsReviewsinaVSE

CaseDescription

Aiwellisacompanywithsevenemployeesproducingsoftwareandhardwareusedinbuildingautomationsystems.Duetohighworkloads,Aiwellfounditchallengingtoallocatesufficienttimeonaconsistentbasistodefinehigh-qualityrequirementspecificationsforitsdiverserangeofprojects.ThecompanywantedtoleverageLLM-basedtoolstoautomateandenhancethequalityofstake-holderandsystemsrequirements,guidedbytheINCOSESystemsEngineeringHandbook[16]andtheSystemsEngineeringBookofKnowledge(SEBoK)[17].EngineersatAiwellalreadyusedGitHubforversioncontrolandwantedtoexploreGitHubActionsfortaskautomation.TheyemployedChatGPTPlusandGitHubCopilotforAI-assistedscriptdesignandtheOpenAIGPT-4APIinscriptexecution.Keymetricsincludedthenumberofrequirementrevisions,timespentoneachrequirement,andthefrequencyofrequirement-taggedissuesapprovedwithoutfurtherrevisions.Initialimplementationledtomorepreciseandcompleterequirements,withfewerrevisionsandreducedtimespentoneachrequirement.Thissectionprovidesdetailsandpracticalinsightsbydemonstrat-inghowAiwellusedLLMseffectivelyandefficientlyregardingresourcesandtime

toimprovetheirengineeringpractices,inparticularwithregardstoimprovingrequirementsquality.

Methodology

Thestudyemployedtwoprimarymethodologies:ActionResearchandGlaser’sGroundedTheoryMethod(GTM).

ActionResearch(AR)isaparticipatory,iterativemethodologyfocusedonsolvingreal-worldproblemsthroughcyclesofplanning,action,observation,andreflection[18].AiwellconductedmultiplecyclesofActionResearchtorefinetheapproach,resolvingimmediatechallengesandplanningforlong-termadaptabil-ity.

Glaser’sGroundedTheoryMethod(GTM)isaqualitativeresearchap-proachthatemphasisesthegenerationoftheorydirectlyfromdatathroughiterativecodingandconstantcomparativeanalysis[19].GTMwaschosenforitsflexibilityinhandlingqualitativedata.Thestudybeganbyapplyingopencodingtorawelectroniccommunication,includinginstantmessagingandcom-mentthreads.Thispreliminaryanalysisidentifiedinitialconcepts,whichwerecontinuouslyrefinedintocategoriesandthemes.Thestudyadoptedaconstantcomparativeanalysis,integratingnewdataiterativelytoevolvetheemergenttheory.Thisapproachmadesuretheresultingtheorywasbothrelevantandcontextuallygrounded.

EthicalandCommercialConsiderations

GivenAiwell’sstatusasaVerySmallEntity(VSE),thestudyhadtoaccountforethicalconcernsassociatedwithdataconfidentialityandanonymity.StrategieswerebasedonSAGEguidelines[18],andincludedfocusingoncollectiveinsightstopreserveinternalanonymityandworkingcloselywithAiwelltoredactsensi-tiveorproprietarydata.

AWorkingDefinitionofRequirementsQuality

Toquantitativelyassessthequalityofarequirement,Aiwellemployedascor-ingsystembasedonkeyattributessuchasclarity,conciseness,testability,andtraceability.Eachattributewasassignedaweight,andallrequirementswereevaluatedonascaleof1to5foreachattribute.Aggregatingtheattributescoresproducedacompositequalityscore.Forexample,arequirementwithaclarityscoreof4,concisenessscoreof5,testabilityscoreof3,andtraceabilityscoreof4wouldyieldacompositescoreof(4*0.3)+(5*0.15)+(3*0.35)+

(4*0.20)=3.8.

EngineersevaluatedrequirementsusinganapproachbasedonPlanningPoker[20],anagileestimatingtechniquewhereteammembersuseplayingcardstovoteonthecomplexityofatask,facilitatingdiscussionandconsensus.

Requirementsengineeringbestpracticesweredrawnfromtwoseminalre-sources:”TheINCOSESystemsEngineeringHandbook”and”TheSystemsEn-gineeringBookofKnowledge”(SEBoK).TheseresourceswerealsoaccessibletotheLLMsthroughhyperlinks,aswasawrittendescriptionofthesystemcon-textandthescoringsystemusedbyhumanevaluators.Thisprovidedabasicframeworkforassessingrequirements.

IntegratingLLMsintheWorkflow

IntroducingLLMsintotheengineeringworkflowhadtobelow-costandrequireminimaltraining.ChatGPTandGitHubCopilotservedastheprimaryLLM-basedtoolsforthispurpose.ChatGPTisanAIconversationalagentcapableofgeneratinghuman-liketextbasedonthepromptsitreceives.GitHubCopilotisanAI-poweredcodeassistantthathelpsengineerswritenewcodeandun-derstandandworkwithexistingcodemoreefficiently.Thetools’affordabilitymadethemattractivechoicesforabudget-conscioussmallentity.GitHubCopi-lot’sseamlessintegrationwithMicrosoftVisualStudioCode,afreecodeeditoralreadyusedbyAiwell,facilitatedasmoothtransition.TheOpenAIGPT-4API,thoughdemandingasteeperlearningcurve,wasrenderedmoreacessiblewiththeassistanceofChatGPTandGitHubCopilot.Whilethecodegener-atedbythesetoolsmightnotalwaysbeperfect,itwassufficienttoexpeditethedevelopmentprocesses.LLMscouldalsoparticipateinmulti-lingual,realis-ticandnuancedhuman-likediscussionswithengineersaboutsubtletopicslikerequirementquality.

Initially,AiwellengineersstartedwiththedefaultinterfacesofChatGPTandGitHubCopilot.Thesetools’easeofuseandintuitiveinterfacemeantthatengineerscouldbeginleveragingtheircapabilitieswithoutextensivetrainingorpreparation.Thisfactorwascrucial,giventheresourceconstraintstypicalofVSEs.Theuser-friendlyandadaptabletoolsfiteffortlesslyintoAiwell’sexistingGitHub-basedversioncontrolandtaskautomationworkflows.

WhenOpenAIintroducedcustominstructionsasanewfeatureinChatGPT,Aiwellincorporateditintotheirworkflowbyaddinglinkstorelevantreferencesasstandardprefixestotheirprompts.ThisallowedengineerstoguidetheAImodelmoreeffectively,aligningthegeneratedresponseswiththeproject’sspe-cificcontextandneeds.StandardisingcustominstructionenhancedthequalityofLLMresponsesbymakingthemmorepreciseandcontext-appropriate.

Astheteamgainedexperience,theybeganexploringmoreadvancedfea-turesoftheGPT-4API.TheystartedcraftingpromptsthatcommunicatedtheirintentmoreclearlytotheLLMs,improvingthequalityofthegeneratedresponse,whethercodeorrequirementsevaluation.Thisincrementalandorganicdeveloper-ledapproachtoadoptingLLMfeaturesensuredtheteamcouldadaptwithoutfeelingoverwhelmed,therebymaintainingproductivity.

LLMsbecameanintegralpartofAiwell’sengineeringpractices,notasadisruptivetechnologyrequiringasteeplearningcurvebutasenablersthatwereincrementallyadopted.Themetrics,includingthereducedrequirementrevisionsandtimespentoneachrequirement,validatedtheeffectivenessofintegratingLLMsintotheworkflow.Crucially,thesegainsneverrequiredadvancedappli-cationssuchaspre-trainedAImodelsorcuratedvectordatabaseswithcustomknowledgebutreliedonlyonfreeandlow-cost,publicallyavailabletools.

CollaboratingWithLLMsonWritingAutomationScripts

Aiwell’sengineersleveragedChatGPTtoautomateGitHubActions,focusingonrequirementsvalidation.TheprocessbeganwithasimpleprompttoChat-GPT,askingittodraftaGitHubActionscripttocalltheGPT-4APIifauserlabelledaGitHubrepositoryissueasarequirement.DespitesomeinitialregularexpressionandJSONparsingchallenges,theengineersiterativelyrefinedtheprompts,leadingtoeffectivescripts.Figure1showsanabbreviatedexampleofthegeneratedcode.

GPT-4’soutput,postedasaGitHubcommentontheissue,comprehen-sivelyevaluatedtherequirementusingSEBoKandtheINCOSEHandbookasareferenceandprovidedtheengineerwithatasklistofsuggestedimprove-ments.Figure2showsthefullanalysisofarequirementgeneratedbyGPT-4.Lateriterationsincludedacompositescorebasedontheprovidedguidelines.TheAI-generatedcommentwouldbreakdowntherequirement’sclaritybyas-sessingitsspecificity,concisenessbyindicatingunnecessarydetails,testabilitybyevaluatingtheclearnessofacceptancecriteria,andtraceabilitybycheckingitslinkagetosystemneedsorstakeholderrequirements.Eachattributewouldreceiveascore,andGPT-4wouldcalculateacompositescoreusingthesamescoringsystemandguidelinesemployedbyAiwell’shumanengineers.Thisap-proachenrichedtherequirementsvalidationprocess,offeringaquantitativeandqualitativeassessmenttosupplementthehumanscoringandencouragecriticaldiscussion.ItalsoshowedhowVSEs,evenwithlimitedresources,canincremen-tallyintegrateLLMsintotheirexistingworkflows.TheuseofChatGPTinscriptautomationnotonlystreamlinedthetaskbutalsoaddedalayerofintelligenceandreview,makingtheprocessmorerobustandefficient.ThispartofthestudydemonstratedthecapabilityofLLMbasemodelstoperformcomplexsystemsengineeringtasksthatalignwithestablishedsystemsengineeringprinciples.

1name:CheckRequirementCorrectness

2on:

3 issues:

types:[labeled]

5jobs:

6 check-requirement:

7 runs-on:ubuntu-latest

8 if:contains(github.event.label.name,’requirement’)

9 steps:

10 -name:Checkoutcode

11 uses:actions/checkout@v2

12 ...

-name:Check

env:

requirement

15 GITHUB_TOKEN:${{secrets.MY_GITHUB_TOKEN}}

16 OPENAI_API_KEY:${{secrets.YOUR_OPENAI_API_KEY

}}

17 run:|

18 ...

19 #LoadtheStatementofWork

20 SYSTEM_CONTEXT=$(catREADME.md)

21 SCORING_GUIDELINES=$(catscoring.md)

23 #Getthecontentrelatedtocheckingcorrectnessofsystemrequirements

24 CORRECTNESS_GUIDELINES="https://sebokwiki.org/wiki/Stakeholder_Requirements_Definition#Checking_Correctness_of_System_Requirements"

25 ...

26 #CallGPT-4APIwithissuedetails

27 RESPONSE=$(curlhttps://api.openai.com/v1/chat/

completions\

28 -H"Content-Type:application/json"\

29 -H"Authorization:Bearer$OPENAI_API_KEY"\

30 -d’{

31 "model":"gpt-4",

32 "messages":[

33 {"role":"system","content":"

$SYSTEM_CONTEXT

$CORRECTNESS_GUIDELINES

$SCORING_GUIDELINES"},

34 {"role":"user","content":"Title:

$ISSUE_NAME.Id:$ISSUE_NUMBER.

Description:$ISSUE_BODY"}

35 ]

36 }’)

37 ...

38 #AddtheresponseasacommenttotheissueusingGitHubAPI

39 curl-s-XPOST-H"Authorization:token

$GITHUB_TOKEN"-H"Content-Type:application/json"-d"$PAYLOAD""$COMMENT_URL"

Fig.1.PartialGitHubActionymlscriptgeneratedbyChatGPT

Thetemperatureoftheglycolmustnotexceed50degrees.##AIGeneratedEvaluation

Therequirementisclear,concise,andunambiguous.However,it

couldbeimprovedbyprovidingmorecontextandspecificity.

Whileitstatesaspecifictemperaturelimit,itdoesnotspecifytheconditionsunderwhichthislimitapplies(e.g.,duringoperation,atrest,etc.).Also,itdoesnotprovideanyacceptancecriteriaoramethodfortestingthisrequirement.

Here’satasklistforimprovingthisrequirement:

[]Specifytheconditionsunderwhichthetemperaturelimitapplies.Forexample,"Thetemperatureoftheglycolmustnotexceed50degreesCelsiusduringoperation."

[]Definetheacceptancecriteria.Forexample,"Thesystemisconsideredtomeetthisrequirementiftheglycoltemperatureremainsbelow50degreesCelsiusduringacontinuous24-houroperationunderstandardsnowfallconditions."

[]Provideamethodfortestingthisrequirement.Forexample,"Thisrequirementcanbetestedbyoperatingthesystemcontinuouslyfor24hoursunderstandardsnowfallconditionsandmonitoringtheglycoltemperature."

Aftertheseimprovements,therequirementmightlooklikethis:

Thetemperatureoftheglycolmustnotexceed50degreesCelsiusduringoperation.Thesystemisconsideredtomeetthisrequirementiftheglycoltemperatureremainsbelow50degreesCelsiusduringacontinuous24-houroperationunderstandardsnowfallconditions.Thisrequirementcanbetestedbyoperatingthesystemcontinuouslyfor24hoursunderstandardsnowfallconditionsandmonitoringtheglycoltemperature.

Fig.2.GPT-4generatedevaluationofrequirement

Challengesandsolutions

DataPrivacyandSecurityIntegratingGitHubCoPilot,ChatGPT,andtheGPT-4APIintoAiwell’sworkflowpresentedsignificantsecurityandprivacychallenges,primarilyduetoalackofclearpoliciesandconcernsexpressedbygovernmentagencies[21].Tomitigatethisuncertainty,thetoolswereinitiallyrestrictedtonon-sensitivematerial.However,asthestudyprogressed,theintro-ductionofGitHubCoPilotforBusinessandMicrosoft’sAzurehostedversionsofOpenAI’smodelsprovidedmoresecurealternatives[22].OpenAI’salsoclarifiedinitspoliciesthatGPT-4APIdataisnotstoredbytheirservers,alleviatingsomeconcerns[23].

ModelLimitationsWhileChatGPTandGPT-4demonstratedproficiencyindomain-specificlanguage,themodelsfalteredintaskslikeregularexpres-sionparsing,mathematicsandJSONformatting.OpenAI’ssubsequentupdate,whichintroducedfunction-callingcapabilitiesintheGPT-4API,addressedtheseissues,enablingdeterministicfunctionsforcomplextasks.

Human-AICollaborationBalancinghumanandAIcontributionsprovedchallenging.AlthoughChatGPTgeneratedeffectiveYMLscripts,itsnumeri-calrequirementscoresoftendivergedfromhumanevaluations.ActionResearchmethodologyhelpedhere,asdiscrepanciestriggereddiscussions,leadingtoanevolutionoftheevaluationcriteriaandamorestableconsensusamongengineers.Also,whengivenaccesstothecommentssectiononGitHub,GPT-4wouldjointhediscussionandbeasopinionatedasahumanengineerifinstructedspecifi-callytobeso.

QualityControlEnsuringthequalityofAI-generatedevaluationsrequiredvigilantoversight.Asystemsengineerconsistentlyreviewedthemodel’soutputs,maintainingahuman-in-the-loopapproachatalltimes.Thisreviewprocesswasintegraltotheiterativecyclesofplanning,action,observation,andreflection.

ErrorHandlingScriptsgeneratedbyChatGPTandGirhubCo-Pilotun-derwentrigorousscrutinyandtesting,andengineersprovidedthetoolswithpromptsbasedonatemplatethatevolvedincrementallybasedonpreviousmistakesmadebytheLLMs.However,sinceGPT-4’soutputwaslimitedtocommentsandnotexecutablecode,theriskofoperationaldisruptionswasmin-imised.

ScalabilityScalingtheapproachforlargertasksorteamsposedchallenges.Thesolutioninvolvedusingsmall,template-basedscriptsandleveragingLLMsforextensivecommentinganddocumentation.

Insummary,thechallengesencounteredbyAiwellweresystematicallyad-dressed,oftenbenefitingfromtheiterativeandreflectivenatureoftheAction

Researchmethodology.Thisapproachnotonlyresolvedimmediateissuesbutalsocontributedtothelong-termadaptabilityandrobustnessoftheengineeringworkflow.

EmergentThemes

TheGroundedTheoryMethod’siterativenaturewaspivotalinthecontinuousrefinementandevolutionoftheemergentthemes.ByensuringthatthesethemesweredeeplyrootedintheexperiencesandfeedbackofAiwell’sengineers,thestudycapturedtheauthenticchallengesandopportunitiesofintegratingAIintotheworkflowsofaVSE.Severaldistinctthemesemergedfromthestudy,eachsheddinglightondifferentfacetsofintegratingAIintotheworkflowsofasmallcompanylikeAiwell.ThesethemesprovideinsightsintotheimmediatebenefitsandchallengesofAIadoptionandhintatthebroaderimplicationsforthefutureofengineeringpractices.

AccessibilityTheeaseofintegratingAItoolsintoAiwell’sworkflowsunder-scoredthethemeoffeasibilityandaccessibility.SomeengineershadinitiallyperceivedtheadoptionofAIasadauntingtask.However,theuser-friendlynatureoftoolslikeChatGPTandGitHubCopilotfacilitatedasmoothtransi-tion.AcommentfromanAiwellteammembercapturedthissentiment:”WethoughtintegratingAIwouldbeamassiveundertaking,butthesetoolsmadethetransitionsurprisinglysmooth.”Thisthemeemphasisesthedemocratisationofadvancedtechnologies,makingthemaccessibleeventosmallerentities.Intheearlystagesofthestudy,thedatapointedtowardsthefeasibilityofintegratingAItoolsintoAiwell’sworkflows.However,asengineersgainedmorehands-onexperiencewiththesetools,theirfeedbackbegantoreflectabroaderperspective.Commentslike”Theintegrationwassmootherthanweanticipated”highlightedthefeasibilityandaccessibilityofadvancedAItechnologies.Thisdevelopmentunder

人人文库> 全部分类> 行业资料 > 信息产业

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审

文档简介

温馨提示

最新文档

评论

GPT-4 的代码协助进一步- 利用 GitHub Copilot 和 ChatGPT 进行 VSE 工程的同行评审

文档简介

温馨提示

最新文档

评论

相关文档