How to Use AI and LLMs Like a Top Scientist in 2025

The landscape of artificial intelligence has undergone a remarkable transformation since the initial release of ChatGPT. By 2025, we find ourselves immersed in a rich ecosystem of Large Language Models (LLMs) and AI tools that extend far beyond the simple text-based interactions that once defined our relationship with these technologies.

If you're still limited to typing queries into ChatGPT and passively waiting for text responses, you're barely scratching the surface of what's possible. Visionaries such as Andrej Karpathy recently demonstrated on YouTube (https://www.youtube.com/watch?v=EWvNQjAaOHw) innovative ways to leverage these tools, opening new possibilities for research, programming, and creative work. In this post, we break down best practices and insights for harnessing AI and LLMs like a top scientist.

Who Is Andrej Karpathy?

andrej

Source:Wikipedia

Andrej Karpathy is a renowned figure in the field of artificial intelligence. A former founding member of OpenAI and Director of AI at Tesla, he has played a significant role in shaping the development and application of deep learning technologies. His academic background includes a Ph.D. from Stanford University. Throughout his career, he has consistently shared his knowledge through various platforms, including his widely-read blog, educational videos, and social media presence.

Part 1 - Harnessing LLMs and AI Tools

The modern AI toolkit is vast and varied. Below are several approaches to using LLMs and related AI tools, each tailored to different tasks and creative challenges.

1. Leveraging Internet Search

Modern LLMs come with integrated internet search functionalities that go beyond static knowledge. These tools can:

Aggregate and Process Web Content: By converting webpages into tokenized data, LLMs dynamically fetch the latest information, compensating for any pre-training knowledge gaps.
Automate Tedious Research: Instead of manually browsing multiple search engines, tools like ChatGPT (with search enabled), Grok, or Deep Seek can handle the heavy lifting, allowing you to quickly access and synthesize up-to-date insights.

AI summarizes daily news and draw news image.Source:ChatGPT

2. Using LLMs as Your Study Buddy

Enhance your learning and research experience by treating your LLM as a personalized study partner:

Document Summarization: Upload documents or refer to books and have the LLM summarize key chapters or sections.
Interactive Learning: Engage in a back-and-forth dialogue with the model to clarify concepts, test your understanding, or dive deeper into complex topics without manually scanning lengthy texts.

3. Deep Research Capabilities

DeepResearch

Source:Helicone

One of the most exciting applications of LLMs today is their ability to perform deep research. Unlike a simple prompt-response mode, deep research allows for:

Comprehensive Analysis: Some LLMs combine internet search with in-depth reasoning to process academic papers and web resources, producing outputs that resemble custom research papers.
Specialized Tools: Platforms such as ChatGPT, Perplexity, Grok, and Gemini excel at synthesizing information from diverse sources, making them indispensable for tackling advanced research projects.

4. Programming with LLMs

AI can now assist throughout the software development lifecycle. Consider these three programming approaches:

4.1 Basic Coding

Assisted Calculations: While language models primarily handle text, they may hallucinate on numerical calculations. Tools like ChatGPT invoke external interpreters (e.g., Python) for precise number crunching.
Data Analysis and Visualization: Prompt LLMs to generate data analysis programs, useful for quick stock evaluations or exploratory data analysis. However, always verify computational results.

4.2 Coding with Live Prototype

LLMs like Claude allow you to write, compile, and run code directly in your browser. This rapid prototyping environment is perfect for generating front-end applications and dynamic diagrams (e.g., Mermaid diagrams).

4.3 Autonomous Coding Agents

LLMs like Cursor act as autonomous coding agents, taking high-level instructions and generating executable code. This tool streamlines development, especially for low-level programming. cursoragent

Source:Cursor

Beyond Text: Multimodal Interactions

Today's AI systems extend well beyond text-based interactions. Embracing multimodality enhances accessibility and functionality:

1. Audio Interactions

Voice-Activated Commands: Many LLMs now support high-quality audio recognition and text-to-speech(in ChatGPT it's called Advanced Voice Mode), making interactions more intuitive and accessible—especially for seniors or those on the go.

Source:OpenAI

2. Visual Inputs with Images

Image Tokenization: Advanced models convert images into tokens, allowing users to upload visual data, analyze charts, or solve mathematical problems based on uploaded formulas.
Creative Generation: Tools like DALL·E 3 generate custom images for artistic projects or illustrative content.

3. Video Integration

Dynamic Understanding: Modern LLMs analyze video inputs, allowing real-time interaction with visual content—answering questions, translating, or extracting key insights.

Personalizing Your AI Experience

A powerful feature of today's LLMs is their ability to adapt to user preferences through customizable memory features:

Memory Banks: Platforms like ChatGPT offer memory banks to store personalized data and preferences.
Customized Settings: Tailor the model's behavior to your habits for a more efficient AI experience.

Overview of Leading LLM and AI Tools

Here's a brief rundown of some top tools shaping AI today:

Perplexity: Ideal for nuanced internet search and research tasks.
Grok: Offers flexible feedback and less restrictive interactions.
ChatGPT: The most comprehensive and capable LLM, per Andrej Karpathy.
NotebookLM: Integrates vast data sources for deep research and note generation.
DALL·E 3: OpenAI's advanced image generation model.
Google Veo-2, OpenAI Sora, Pika: Leading tools for video content generation.

Tips for Using LLMs Effectively

To maximize AI potential, follow these best practices:

Verify Sources: Cross-check responses to avoid misinformation.
Start Fresh: Begin a new chat session when shifting topics.
Be Model-Savvy: Choose the right model based on task complexity.
Embrace Deep Research Tools: Opt for LLMs designed for long-form analysis.

Part 2 - Understanding the Inner Workings of LLMs

The LLM Ecosystem

Since ChatGPT's debut, the ecosystem has expanded with models like Gemini, Co-pilot, Claude, and Grok. Tools like Chatbot Arena and the SEAL leaderboard benchmark their performance.

Tokenization and Context Windows

LLMs break text into tokens, forming a context window—the model's working memory. Managing this window effectively is crucial for optimal performance. contextwindow

Source:TechTarget

Limitations and "Zip File" Analogy

Think of an LLM as a massive, compressed zip file of internet knowledge. While efficient for retrieval, it may lack recent information unless supplemented with active tool use.

Practical Tips

Refresh Context Frequently: Begin new chats for focus.
Choose Models Wisely: Consider computational cost and task suitability.
Monitor Tool Use: Be aware of integrated web search or reasoning modules.
Opt for Deep Research: Use tools designed for long-form analysis.

The Future: Handling Audio Natively

The next frontier involves native audio handling, breaking sound into frequency-based tokens for seamless voice interactions. Features like ChatGPT's advanced voice mode enable real-time conversations, blurring the line between human and machine.

To Be Continued

By 2025, AI and LLMs have evolved into a comprehensive suite of tools catering to all aspects of life. Understanding their mechanics and exploring innovative applications—from deep research to multimodal interactions—unlocks their full potential.

Happy exploring!