Evaluating Job Exposure to Large Language Models

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on December 16, 2023 8:14:18 PM ● Video Link: https://www.youtube.com/watch?v=Py5QDISpB7s

Duration: 25:43

173 views

GPTs are GPTs

Speaker: Daniel Rock

Summary
========
Daniel discusses the findings of their paper, Generative Pre-trained Transformers are General Purpose Technologies, and the broader implications of AI systems in the workforce. They address the question of whether AI systems will negatively impact employment and explore the idea that large language models, such as Generative Pre-trained Transformers (GPT), can significantly transform the economy. The speaker takes an optimistic stance, stating that they do not believe there will be massive job displacement due to these technologies. They observe a shift towards non-routine cognitive tasks and introduce a new approach to evaluating the exposure of tasks to large language models.

Using a new rubric-based approach on tasks from the O*NET database, we quantify the labor market impact potential of LLMs. Both human annotators and GPT-4 assess tasks based on their alignment with LLM capabilities and the capabilities of complementary software that may be built on top of these models with our rubric. Our findings reveal that between 61 and 86 percent of workers (for LLMs alone versus LLMs fully integrated with additional software) have at least 10 percent of their tasks exposed to LLMs. Additional software systems have the potential to increase the percentage of the U.S. workforce that has at least 10 percent of their work tasks exposed to the capabilities of LLMs by nearly 25 percent. We find that LLM impact potential is pervasive, LLMs improve over time, and complementary investments will be necessary to unlock their full potential. This suggests LLMs are general-purpose technologies. As such, LLMs could have considerable economic, societal, and policy implications, and their overall impacts are likely to be significantly amplified by complementary software.

Topics
=====

⃝ Impact of AI systems on employment
* The speaker believes that there will not be massive job displacement due to AI systems.
* There is a shift towards non-routine cognitive tasks.
* A new approach to evaluating the exposure of tasks to large language models is introduced.

⃝ Evaluation of exposure to large language models
* The speaker and their team used the O*NET database to evaluate the exposure of tasks.
* Tasks were categorized into ‘No Exposure (E0)’, ‘Exposure with LLM (E1)’, and ‘Exposure with LLM + other software (E2)’.
* Human annotators provided “opinions” about the exposure level of various tasks within Jobs. The results were compared about GPT4 predictions and used as the baseline of the evaluation. There is agreement between humans and GPT-4 in evaluating tasks at the occupation level.
* Approximately 80% of workers may have around 10% of their tasks exposed to large language models.

⃝ Impact of automation on jobs
* Certain types of workers are more exposed to automation assuming no significant change in the subset of tasks associated with their job and not enough resources for reskiling / upskilling.
* Automation can potentially improve job satisfaction by removing mundane tasks and shifting cognitive energy to more creative and demanding ones.
* Job descriptions, list of subtasks in a job, are dynamic and creating the right infrastructure is important for safe and responsible adoption of technology.
* The current study focuses on task level assessment of exposure and future work could explore system level exposure by including more high level dependencies.
* It is difficult to advice on the necessary policy solutions but the importance of evaluation and quality control is clear.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2024-03-15	Multi-agent LLMs Course #business #startup https://maven.com/forms/30a683
2024-03-15	LLM Evaluation, Validation, and Verification
2024-03-14	How Do You Validate LLM Systems Beyond Benchmarks?
2024-03-13	Can Sherpa (multi-agent llm) Handle Multi-modality?
2024-03-12	What Kind of Risks Are Specific to LLMs?
2024-03-08	LLMs, What Skills to Learn? and What a Time to be Alive!
2024-03-07	How do you Force an LLM to Keep Track of the Assumptions a Document Makes?
2024-03-06	How to Annotate Data for LLM Applications
2024-03-05	What is the Role of Data Quality and Diversity in LLM Systems?
2023-12-16	Testing Strategies for LLMs - SHERPA - Open Source Project Update, 2023-12-08
2023-12-16	Evaluating Job Exposure to Large Language Models
2023-12-16	Empirical Rigor in ML
2023-12-16	Evaluation of Multimodal RAG Systems using the LlamaIndex
2023-12-16	Intro to Language Model Operations (LLM-Ops)
2023-12-16	Normie Tools for Validating LLM Outputs
2023-12-16	Automatic Evaluation of Dialogue Systems using LLMs
2023-10-27	SHERPA - Open Source Project Update, 2023-09-29
2023-10-27	Eliciting Business Insights at Scale with Conversational AI
2023-10-27	Challenges and Solutions for LLMs in Production
2023-10-27	Practical Applications, Impact, and ROI of Generative AI
2023-10-27	Role of Human Factors in Adoption of Generative AI in Life Sciences

Tags:

deep learning

machine learning

Channel	Latest
Lithia Toyota of Redding	6 hours ago
Lyna	6 hours ago
Driving Sports TV	6 hours ago
Akamatzu	6 hours ago
Winding Road Magazine	6 hours ago
Sanford INFINITI	6 hours ago
Cleysson Gamer	7 hours ago
TV Sul	7 hours ago
AbsintoJ	7 hours ago
Canal 21 Ebre	7 hours ago
Gameplay Arena	7 hours ago
Kevin Balázs	7 hours ago
TH+ SBT Interior	7 hours ago
Lizzie and Koala Skywalker	7 hours ago
TheGamerHennyRoc	7 hours ago
Framianos	7 hours ago
Gaba	7 hours ago
Mediotiempo	7 hours ago
Prithwiraj Ghosh	7 hours ago
RoMike2013	7 hours ago
Vanhellsing TV	7 hours ago
Light Tukiakari Ch.	7 hours ago
Big Trend Show	8 hours ago
Pickup Truck Plus SUV Talk	8 hours ago
Savage Sam 43 Gaming	8 hours ago