InfoQ Live Logo

The Software Architects' Newsletter
October 2024
View in browser

Welcome to the InfoQ Software Architects’ Newsletter! We bring you essential news and experience on emerging patterns and technologies from industry peers each month.

This month, we focus on "The AI Shift: Evolving Roles for Software Architects". Roles, patterns, and practices from this topic span the entire "diffusion of innovation" graphs in our 2024 reports: InfoQ Software Architecture and Design Trends Report and InfoQ AI, ML, and Data Engineering Trends Report. We also cover this topic in our Generally AI podcast series with Anthony Alford and Roland Meertens.

For example, data continues to be a major force in architectural decisions. Complex analytical platforms and ML models are no longer considered secondary components as they shift toward core parts of transactional systems. AI Agents, like coding assistants, will also see more adoption, especially in corporate application development settings.

Key challenges remain. Large language models (LLMs) have become a common feature in nearly every corner of the industry, but significant innovation opportunities remain to take LLMs beyond glorified chatbots. AI safety and security will also continue to be important in the overall management lifecycle of language models.

News

OpenAI Developer Day 2024 (SF) Announces Real-Time API, Vision Fine-Tuning, and More

On October 1, OpenAI SF DevDay 2024 introduced several new features and hosted workshops, breakout sessions, and demos. New features unveiled include a Real-Time API with function calling, vision-fine tuning, distillation, and prompt caching.

The Real-Time API allows for persistent WebSocket connections, enabling real-time voice interactions. This capability is crucial for applications that require instantaneous responses, like virtual assistants and real-time translation services. The API allows developers to send and receive JSON-formatted events, representing various interaction elements such as text, audio, function calls, and interruptions.

Setting up a Data Mesh Organization

According to Matthias Patzak, a data mesh organization comprises producers, consumers, and the platform. Patzak talked about data mesh platforms at FlowCon France, where he stated that the platform team’s mission is to make the producers’ and consumers’ lives simple, efficient, and stress-free. Data must be discoverable, understandable, trustworthy, and securely and easily shared across the organization.

NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision Capabilities

NVIDIA unveiled NVLM 1.0, an open-source multimodal large language model (LLM) that performs on both vision-language and text-only tasks. NVLM 1.0 shows improvements in text-based tasks after multimodal training, standing out among current models. The model weights are now available on Hugging Face, with the training code set to be released shortly.

Hugging Face Upgrades Open LLM Leaderboard v2 for Enhanced AI Model Comparison

Hugging Face has recently released Open LLM Leaderboard v2, an upgraded version of their popular benchmarking platform for large language models.

The leaderboard serves multiple purposes for the AI community. It helps researchers and practitioners identify state-of-the-art open-source releases by providing reproducible scores that separate marketing claims from actual progress. It allows teams to evaluate their work, whether pre-training or fine-tuning, by openly comparing methods against the best existing models. Additionally, it provides a platform for earning public recognition for advancements in LLM development.

PyTorch Conference 2024: PyTorch 2.4/Upcoming 2.5, and Llama 3.1

The Linux Foundation recently hosted the PyTorch Conference 2024 in Fort Mason, San Francisco. The conference showcased the latest advancements in PyTorch 2.4 and Llama 3.1 and some upcoming changes in PyTorch 2.5. Matt White, executive director of the PyTorch Foundation and GM of AI at the Linux Foundation, opened the conference on Day 1 by highlighting the importance of open-source initiatives in advancing responsible generative AI.

Case Study

InfoQ AI, ML, and Data Engineering Trends Report—September 2024

The InfoQ Trends Reports offer InfoQ readers a comprehensive overview of emerging trends and technologies in the areas of AI, ML, and Data Engineering. This report summarizes the InfoQ editorial team’s podcast with external guests to discuss the trends in AI and ML and what to look out for in the next 12 months. In conjunction with the report and trends graph, an accompanying podcast features insightful discussions of these trends.

The key takeaways from this year’s report included:

  • The future of AI is open and accessible. We’re in the age of LLM and foundation models. Most of the models available are closed-source, but companies like Meta are trying to shift the trend toward open-source models.
  • Retrieval Augmented Generation (RAG) will become more important especially for applicable use cases of LLMs at scale.
  • AI-powered hardware will get much more attention with AI-enabled GPU infrastructure and AI-powered PCs.
  • Due to the constraints in infrastructure setup and management costs of LLMs, small language models (SLMs) will see more exploration and adoption.
  • Small language models are also excellent for edge computing-related use cases that run on small devices.
  • AI Agents, like coding assistants, will also see more adoption, especially in corporate application development settings.
  • AI safety and security will continue to be important in the overall management lifecycle of language models. Self-hosted models and open-source LLM solutions can help improve the AI security posture.
  • Another important aspect of the LLM lifecycle is LangOps or LLMOps, which help support the models after deploying them to production.

This content is an excerpt from a recent InfoQ article by Srini Penchikala et al., "InfoQ AI, ML, and Data Engineering Trends Report - September 2024".

To get notifications when InfoQ publishes content on these topics, follow "AI, ML & Data Engineering", "Machine Learning", and "Generative AI" on InfoQ.


Missed a newsletter? You can find all of the previous issues on InfoQ.

Sponsored

Evolving the Agile Organization with Evidence-Based Management - Sponsored by Scrum.org

A fundamental element of Scrum is empirical process; the idea that complex problems require real experience to effectively plan and deliver value. Evidence-Based Management (EBM) is a set of ideas and practices that describe broad measurement areas used to provide an effective, empirical, and value-based approach to any product. This Focus Area - presented by Scrum.org - describes what EBM is and how to apply it to any product. Discover the practices that comprise EBM and how to use them to enable a business-driven, value-based empirical process.

Read the guide “Evolving the Agile Organization with Evidence-Based Management”, sponsored by Scrum.org

Upcoming Events

QCon: For practitioners, by practitioners

QCon San Francisco 2024, November 18-22

Discover real-world insights and practical solutions at QCon San Francisco 2024. Understand the practices, skills, and trends that matter most in software to drive your projects and team’s growth. Gain proven approaches from those leading change and innovation. Last chance to save!


QCon London 2025, April 7-11

QCon London will bring together senior developers and architects for deep dives into the latest emerging trends, best practices, and use cases. The 15 tracks include AI, ML, FinTech, modern architectures, security, leadership, and more! Save with our early bird pricing.


InfoQ Dev Summit Boston 2025, June 9-10

Join senior developers in Boston for insights on AI, scalable architectures, and more. Early bird pricing available.

About InfoQ

Senior software developers rely on the InfoQ community to keep ahead of the adoption curve. One of the main reasons software architects and engineers tell us they keep coming back to InfoQ is because they trust the information provided and selected by their peers.

We’ve been helping software development teams adopt new technologies and practices for over 19 years through InfoQ articles, news items, podcasts, tech talks, trends reports, and QCon software development conferences.

We hope you find this newsletter useful. If not, you can unsubscribe using the link below.

Unsubscribe

Forwarded email? Subscribe and get your own copy.

Subscribe

Follow InfoQ on:

You have received this email because you subscribed to "The Architects' Newsletter". To stop receiving the Architects' Newsletter, please click the following link: Unsubscribe

- - -

C4Media Inc. (InfoQ.com), 705-2267 Lake Shore Blvd. West,
Toronto, Ontario, Canada, M8V 3X2