Chasing Compute: The Role of Computing Resources in Publishing Foundation Model Research

Yuexing Hao^1,2†, Yue Huang^3†, Haoran Zhang¹, Chenyang Zhao⁴, Zhenwen Liang³, Paul Pu Liang¹, Yue Zhao⁶, Lichao Sun⁵, Saleh Kalantari², Xiangliang Zhang³, Marzyeh Ghassemi^1*

¹ EECS, MIT; ² Cornell University; ³ CSE, University of Notre Dame; ⁴ Computer Science Department, UCLA; ⁵ CS, Lehigh University; ⁶ School of Advanced Computing, USC

^* Corresponding author: mghassem@mit.edu; ^† Equal contribution

Exploring the relationship between computing resources and scientific advancement in foundation models (FM).

Research Overview

Cutting-edge research in Artificial Intelligence (AI) requires considerable resources, including Graphics Processing Units (GPUs), data, and human resources. In this paper, we evaluate the relationship between these resources and the scientific advancement of foundation models (FM). We reviewed 6,517 FM papers published between 2022–2024 and surveyed 229 first-authors to understand the impact of computing resources on scientific output. We find that increased computing is correlated with individual paper acceptance rates and national funding allocations, but is not correlated with research environment (academic or industrial), domain, or study methodology.

We analyze 34,828 accepted papers between 2022 and 2024, identifying 5,889 FM papers, and examine GPU access and TFLOPs alongside their correlation with research outcomes. We further incorporate insights from 229 authors across 312 papers about resource usage and impact.

Study Design

Results

Temporal evolution of FM research, phases, methods, domains, and GPU model distribution in scraped and self-reported data. — Figure 2. Temporal Evolution of FMs. A) FM papers as a proportion of published papers over time. B) Evolution of FM papers over phases, methods, and domains. C) Temporal evolution of GPU model distribution in LLM‑extract and self‑reported data. “Pub.” denotes publications from the scraped dataset, while “Await Publish” refers to papers that are either under review, rejected, or in preparation for submission.

Distribution analyses of FM papers: senior author affiliation, countries, LLM usage, GPU types; with boxplots of GPU Number and TFLOPs across categories. — Figure 3. Distribution of FM Papers. We analyze FM papers across various dimensions: (A) Senior Author’s Affiliations in (A1) Academia and (A2) Industry; (B) Countries by Senior Author’s Affiliation; (C) Paper Count by LLM Usage; and (D) GPU Types used. The boxplots below panel (D) display GPU Number and TFLOPs across four categories: *Affiliation*, *Phase*, *Method*, and *Domain*. TFLOPS: Tera Floating‑Point Operations Per Second; FP16: 16‑bit floating‑point format (Note: * p < 0.001).

Funding distribution by country and source categories; relationships between resources, output, and citations in academic and industry settings. — Figure 4. Funding Distribution of FM Research. Only 15.3% of FM papers contain funding country and agency information in their manuscript. A) Distribution of funding by country across three categories: Government, Corporate, and Foundation. B) Relationship between each country’s GDP per capita and the number of funded papers. For academic and industry settings: C) Relationship between available GPU resources and average number of papers produced. D) Relationship between GPU resources and average citation count per paper.

Discussion

More/Better?

Our study provides empirical evidence that increasing the number of GPUs does not inherently lead to higher research impact. This is important as unchecked expansion of computational requirements further exacerbates environmental concerns (Schwartz et al., 2020). Furthermore, the current landscape of FM research remains highly centralized, with China and the United States disproportionately dominating the field, as access to computing resources often serves as a fundamental prerequisite for participation (Lehdonvirta, 2024).

We note that many papers utilized multiple GPUs for different tasks, making it challenging to clearly categorize GPU numbers, types, and memory configurations. Consequently, our calculated FLOPs values may differ slightly from the actual computational resources reported in these studies.

Open Reporting

While initiatives such as the required computing statement from some conferences acknowledge the role of computational resources, they remain insufficiently reported. Greater transparency in GPU usage (Bommasani et al., 2025) and recognition of computing resources, including GPU availability, storage, and human labor, are integral components in evaluating AI research. This will ensure the long-term sustainability of AI research (Maslej et al., 2025).

While we quantified GPU usage and authorship, other resource costs are often overlooked. The cost of failed experiments is rarely acknowledged; research highlights successful outcomes, yet unsuccessful attempts are crucial to the progress of FM research. Furthermore, infrastructure costs, which vary between countries due to gross domestic product (GDP) and AI policies, are generally not considered.

Automated Evaluation

We relied on GPT-4o to extract and summarize detailed information from PDF files, a method which is susceptible to inaccuracies. We performed ten rounds of GPT-4o extraction and summarized the results through majority voting to minimize errors. Nevertheless, the extracted information may still contain inaccuracies, necessitating careful interpretation and validation.