training GPT-3.5 on Microsoft’s Azure AI supercomputing infrastructure required roughly 3,640 petaflop-days (the equivalent of one quadrillion calculations per second over 3,640 days). As AI ...