Author: Richard

New Study Questions Results of Apple’s LLM ‘Reasoning Breakdown’ Investigation

**The Deception of Thought: A Critical Analysis of Apple’s AI Research and Its Counterargument**

Apple’s latest AI research publication, “The Deception of Thought,” has generated considerable debate within the AI field due to its claim that even the most sophisticated Large Reasoning Models (LRMs) struggle with complex tasks. Nonetheless, this perspective has been contested by Alex Lawsen, a researcher at Open Philanthropy, who released a counterargument titled “The Deception of the Deception of Thought.” Lawsen contends that the problems emphasized in Apple’s findings arise from experimental design weaknesses rather than fundamental limitations in the reasoning abilities of LRMs.

### The Counterargument: Less “Deception of Thought,” More “Deception of Assessment”

Lawsen’s analysis recognizes that while LRMs do encounter difficulties with intricate planning tasks, Apple’s conclusions misinterpret the motivations behind these difficulties. He pinpoints three main flaws in Apple’s experimental design:

1. **Token Budget Constraints Overlooked**: Lawsen indicates that Apple’s assertion of model “collapse” during the Tower of Hanoi challenges with 8 or more disks neglects the fact that models such as Claude were hitting their token output limits. He refers to instances where models explicitly mention truncating outputs to save tokens.

2. **Classifying Unsolvable Puzzles as Failures**: In Apple’s River Crossing assessment, some puzzles were unsolvable due to mathematical limitations (e.g., exceeding the boat’s capacity). Lawsen argues that models were penalized for acknowledging these impossibilities and opting not to pursue a solution.

3. **Evaluation Algorithms Mislabeling Outputs**: Apple’s evaluation was dependent on automated systems that assessed models primarily based on complete move sequences. This methodology did not consider circumstances where tasks surpassed token limits, resulting in unjust classifications of partial or strategic outputs as complete failures.

### Alternative Testing: Allow the Model to Generate Code Instead

To support his argument, Lawsen executed a subset of the Tower of Hanoi tests using an alternative method: requesting models to create a recursive Lua function to produce the solution rather than detailing all moves. The outcomes demonstrated that models such as Claude, Gemini, and OpenAI’s o3 successfully generated correct algorithmic solutions for 15-disk challenges, countering Apple’s assertions of zero success at that complexity level.

Lawsen concludes that when artificial output restrictions are lifted, LRMs exhibit a strong capability for reasoning about complex tasks, especially in terms of algorithm generation.

### The Significance of This Dispute

This conversation goes beyond simple academic critique; it has greater consequences for the comprehension of LLMs’ reasoning skills. Apple’s paper has been extensively referenced as proof that current LLMs lack scalable reasoning capabilities. Lawsen’s counterargument implies a more intricate reality: while LLMs encounter difficulties with long-form token counting under current constraints, their reasoning mechanisms may be more resilient than previously suggested.

Lawsen acknowledges that genuine algorithmic generalization remains a hurdle and stresses the necessity for forthcoming research to concentrate on:

1. Creating evaluations that distinguish between reasoning capability and output limitations.
2. Confirming the solvability of puzzles prior to evaluating model performance.
3. Employing complexity metrics that mirror computational difficulty rather than simply solution length.
4. Considering various solution representations to differentiate algorithmic comprehension from execution.

The essence of Lawsen’s argument is that before branding the reasoning capabilities of LRMs as inherently flawed, it is crucial to reevaluate the evaluation criteria that are being utilized.

In conclusion, the ongoing discourse regarding Apple’s research and Lawsen’s counterargument underscores the significance of meticulous experimental design and evaluation in AI research. It advocates for a more nuanced understanding of LLM capabilities and the conditions under which they are assessed.

Read More
Apple Tackles Significant Passkey Problems in iOS 26 Update

Apple’s forthcoming OS updates are poised to unveil a much-anticipated functionality designed to improve the usability of passkeys across various platforms and applications. This innovative feature will enable users to effortlessly and securely export and import passkeys, tackling a major limitation that has impeded the uptake of this security solution.

Traditionally, passkeys generated on Apple devices such as Macs, iPhones, and iPads were restricted to the Apple ecosystem. Although they could sync between iCloud-connected devices, moving them to other platforms like Windows or Android, or to third-party credential managers, was not possible. This limitation sparked worries about vendor lock-in, as users faced challenges in accessing their passkeys if they changed devices or lost connection to their Apple hardware.

The issue of portability is not exclusive to Apple; it has been a common challenge within the tech industry. The FIDO Alliance, which comprises key stakeholders like Google, Microsoft, and various password management services, has been striving to create secure methods for diverse platforms and applications to interact while maintaining the security that passkeys offer.

With the backing of the FIDO Alliance, Apple is now implementing native support for passkey import and export. This functionality will be part of the upcoming iOS 26, macOS Tahoe 26, iPadOS 26, and visionOS 26 updates. Notably, this new system will not only ease the transfer of passkeys but will also permit the secure movement of passwords and verification codes.

A major enhancement in this feature is the transfer method. In contrast to conventional password exports that frequently involve unencrypted files, the new approach aims to be end-to-end encrypted. Transfers will take place directly between credential manager apps or from the system keychain to an application, needing local authentication, such as Face ID or Touch ID, to kick off the process. This method mitigates the risks associated with keeping sensitive data in export files.

Apple’s demonstration highlighted that this feature enables users by giving them increased control over their data and the option to select their preferred credential manager. This signifies a significant transformation in Apple’s strategy, moving away from the closely integrated Keychain ecosystem that has defined its services.

The launch of this feature later this year is anticipated to ease concerns about ecosystem lock-in, motivating more users to embrace passkeys as a secure alternative to conventional passwords. For those curious about the technical details, comprehensive information is available in Apple’s “What’s new in passkeys” session on the Apple Developer website.

Read More
iOS 26 Design Improvements Boost Appeal for iPhone 17 Air

After months of speculation, Apple has officially presented its major iOS 26 ‘Liquid Glass’ redesign, already generating excitement for the upcoming iPhone 17 Air. Here’s the reason why.

### Liquid Glass Integrates iPhone Hardware and Software More Cohesively Than Ever

The new Liquid Glass design of iOS 26 and Apple’s other platforms signifies a long-term transformation. As Craig Federighi mentioned in the keynote, this represents a once-in-a-decade transition for Apple, laying the groundwork for years of future hardware advancements.

The more one reflects on the new design of iOS 26, the more enthusiasm grows for the iPhone 17 Air. By the time iOS 26 launches this fall, Apple will have just introduced its new ultra-slim iPhone model, which was certainly crafted with Liquid Glass in mind.

Designer Sebastiaan de With effectively expressed the idea well before WWDC, noting that Apple’s interface finally reflects the stunning material qualities of its devices. With all device surfaces featuring glass screens, this fresh interface provides a corresponding material, allowing users to feel the glass itself coming to life.

With Liquid Glass, iOS 26 looks to harmoniously combine the iPhone’s hardware and software, establishing a closer relationship than ever before. This combination is anticipated to excel on the ultra-sleek new iPhone 17 Air.

### Why iPhone 17 Air Might Be the Ideal Match for iOS 26

During the WWDC keynote, as Federighi demonstrated iOS 26 on his iPhone, it became clear that the current iPhone design appeared antiquated for the new software. The thick, cumbersome design of existing models does not appear to align well with Liquid Glass.

Liquid Glass seems to be designed for devices where the display serves as the primary design feature, embodying the envisioned “single slab of glass” design that Jony Ive has pursued for years. This dream is anticipated to be fully manifested in 2027 with a unique 20th anniversary ‘all-screen’ iPhone, but for this year, the iPhone 17 Air appears to be the best realization of this concept.

Liquid Glass will undoubtedly represent a refreshing change for all iPhone models, yet the iPhone 17 Air, with its dramatically sleek design, could reflect a glimpse of the future today. The design of iOS 26 and Liquid Glass is tailored for slimmer, lighter iPhones, which is precisely what the iPhone 17 Air aims to offer.

### Conclusion

The excitement around iOS 26’s Liquid Glass design and its synergy with the iPhone 17 Air underscores a significant shift in Apple’s design approach. The seamless integration of hardware and software promises to elevate user experience, making the forthcoming iPhone model more enticing than ever.

Read More
New Leak Uncovers Galaxy Tab S11 Ultra Featuring Upgraded MediaTek Premium Processor

Samsung’s Galaxy Tab S11 Ultra has been recently detected in a Geekbench performance evaluation, showcasing some fascinating specifications. The tablet is reported to be equipped with MediaTek’s Dimensity 9400 Plus SoC, 12GB of RAM, and operates on Android 16 with One UI 8. In spite of the robust chip, it received lower scores than smartphones utilizing the same SoC, scoring 5,312 in multi-core and 1,420 in single-core assessments.

Rumors indicate that Samsung will bring back the base model in the Tab S11 lineup, eliminating the Plus variant, and the Ultra edition might feature a marginally bigger battery. The Dimensity 9400 Plus chip comprises a Big Core architecture with an Arm Cortex-X925 core clocked at 3.73GHz and a 12-core Arm Immortalis-G925 GPU, ensuring improved performance and graphics.

The Galaxy Tab S11 Ultra is anticipated to be unveiled following Samsung’s summer Unpacked event, likely in the autumn.

Read More
Apple Unveils Worldwide In-Person Meetings to Delve into WWDC25 Enhancements Thoroughly

Apple is taking WWDC25 on the go. After concluding its week-long developer conference, the company has arranged a global series of in-person events and online meetings to provide developers, designers, and product managers with an in-depth look at what’s new. Here’s how you can sign up.

Labeled as “Discover the major updates from WWDC25,” these sessions are intended to showcase key technologies revealed during the conference, including enhancements to Apple Intelligence, visionOS, developer tools, and cross-platform app design.

The in-person events are accessible to members of the Apple Developer Program and will be held in English, while the online sessions will take place in local languages, such as Mandarin, Spanish, and Brazilian Portuguese.

These sessions go beyond mere technical summaries. They are crafted to encourage dialogue about how the new features can influence the future of apps and games. Participants can look forward to a combination of presentations, live demonstrations, and Q&A sessions with Apple experts.

### A worldwide journey

The in-person tour begins on June 19 with simultaneous events in Singapore, Ho Chi Minh City, and Jakarta, before moving on to London on June 23.

After that, Apple will keep hosting sessions in cities like Paris, Stockholm, Toronto, Tokyo, Berlin, New York City, Madrid, Warsaw, São Paulo, and more.

Extra events will take place throughout July in locations such as Bengaluru, Gurugram, Mexico City, Istanbul, Vancouver, and Miami.

Numerous countries will also feature online sessions and one-on-one consultations, covering in-depth looks into Apple Vision Pro, App Review guidance, and localized tech onboarding, with special availability for teams based in Shanghai, Seoul, Cupertino, and Tokyo.

You can check

Read More
Transforming Apple Intelligence into an Independent AI Chatbot Without Utilizing ChatGPT

With WWDC 2025 now concluded, numerous Apple enthusiasts found themselves disheartened by the absence of newer Apple Intelligence features. Although Cupertino is introducing some fresh AI functionalities to the platform, the most eagerly awaited one remains uncertain.

This is the reason Apple executives have discussed in several interviews why the new Siri has yet to be launched. While it was first anticipated to debut alongside iOS 18.4, Bloomberg’s Mark Gurman now indicates that Apple is targeting an iOS 26.4 release.

While this may be even more disheartening for users (especially since OpenAI and other rivals continue to advance their AI models at an unprecedented pace), there is an opportunity to utilize Apple Intelligence as a genuine AI chatbot (and I’m not referring to using ChatGPT for this purpose).

This feature is still only accessible as part of the new capabilities arriving with iOS 26 and macOS Tahoe, which are presently in the initial stages of beta testing.

An Apple Intelligence chatbot is concealed within iOS 26 and macOS Tahoe

As highlighted by my good friend and MacWorld journalist Filipe Espósito on Threads, it is possible to operate your own Apple Intelligence chatbot if you have the iOS 26 or macOS Tahoe beta installed on your devices.

The Apple Intelligence chatbot can be fueled by Apple’s AI features within the new Shortcuts app. As explained by Espósito, you can select between the on-device model, which is less potent, or the Private Cloud Compute option, which functions online and performs quite well as a chatbot.

Read More
“Pebble Smartwatch is Set to Relaunch Next Month Featuring an Enhanced App”

The Pebble rejuvenation project, led by original creator Eric Migicovsky, aims to bring smartwatches back to users through the RePebble initiative. The endeavor is approaching a crucial landmark, with the Core 2 Duo watches getting ready for mass production, and pre-orders are slated to ship in July and August. Core Devices, the fresh company behind this initiative, is also unveiling a companion app for iOS and Android, which will be compatible with both new and previous Pebble watches.

Pebble, previously celebrated for its budget-friendly and durable smartwatches, is being revitalized by Migicovsky under Core Devices. The Core 2 Duo watch is on the verge of mass production, with 200 units already completed. Pre-orders are set to be fulfilled by the end of summer, with some beta testers possibly receiving their watches sooner. Furthermore, the Core Time 2 watch is advancing toward production, with engineering samples anticipated shortly.

The RePebble initiative gains from Google’s choice to open-source PebbleOS, enabling both new purchasers and current Pebble users to explore the software. This action could revitalize older Pebble devices, as enthusiasts can find PebbleOS on GitHub.

Migicovsky invites involvement in the beta testing of the new app, which works with older Pebble models. Interested participants can register for the beta test, although Migicovsky notes that the project is most suitable for those willing to give comprehensive feedback without public criticism.

For individuals who have placed pre-orders for the Core 2 Duo, an email confirmation will be dispatched soon to finalize shipping information, including payment of any applicable duties or taxes. The Pebble revival is nearing completion, and new Core Devices watches are available for pre-order now.

Read More
T-Mobile Provides Complimentary Motorola Razr Plus Without the Need for Trade-In

Add a line, receive the phone.

(Image credit: Derrek Lee / Android Central)

T-Mobile offers a variety of fantastic deals, but there’s one exclusive offer that allows you to obtain a new Motorola Razr Plus (2025) at no cost. Just visit the carrier and add a line under the Experience More, Experience Beyond, Go5G Plus, or Go5G Next plan, and you’ll qualify for up to $1,000 in promotional credits over 24 months when you purchase any Motorola Razr (2025) model.

This amount can fully cover the cost of the standard Razr and Razr Plus, or if you prefer the premium Motorola Razr Ultra (2025), the price will be reduced to just $299.99. Regardless of the model you select, you’ll be getting one of the <a data-analytics-id="inline-link" href="https://www.androidcentral.com/phones/motorola/the-best-motorola-razr-razr-plus-razr-ultra-2025-deals" data-before-rewrite-localise="https://www.androidcentral.com/phones/motorola/the-best-motorola-razr-razr-plus-razr-ultra-2025

Read More
Apple Unveils iOS 26 Beta 1 Build for Developers

Apple has recently introduced iOS 26, rolling out the initial developer beta along with an updated build for certain iPhone models just days following the first release. This quick succession of beta versions is rather atypical for a significant OS upgrade, reflecting Apple’s proactive stance on tackling potential challenges.

The inaugural version of iOS 26 beta 1 was marked by the version number 23A5260n. The newly launched build, 23A5260u, exhibits only a minor change in its version number, implying that there might not be any considerable changes for users. It is rumored that Apple might have discovered a serious bug, security issue, or unintended code related to upcoming products, which has led to the issuance of this revised build.

At present, the updated version of iOS 26 beta is available solely for the iPhone 16 and iPhone 15 models, meaning that users with older iPhone variants will not benefit from this update. As testing progresses, additional information and modifications within the iOS 26 beta will be reported as they develop.

For users contemplating the installation of the iOS 26 beta, expert advice is accessible regarding the potential hazards and advantages of utilizing beta software on primary devices. Users who have already implemented the beta are encouraged to share their feedback, aiding the community in comprehending the new operating system.

Alongside the beta update, conversations about the finest iPhone accessories persist as a point of interest among users, enriching the overall iPhone experience.

Read More