Everything you’d like to know about visionOS development

If you’re venturing into the realm of developing applications for Apple Vision Pro, it’s crucial to equip yourself with the right knowledge. In this article, we unravel the key aspects you need to know about the visionOS operating system, the secrets of programming for Apple Vision Pro, and the essential tools required for app development.

visionOS: The Heart of Apple Vision Pro

The foundation of the Vision Pro headset lies in the sophisticated visionOS operating system. Tailored for spatial computing, visionOS seamlessly merges the digital and physical worlds to create captivating experiences. Drawing from Apple’s established operating systems, visionOS introduces a real-time subsystem dedicated to interactive visuals on Vision Pro. This three-dimensional interface liberates apps from conventional display constraints, responding dynamically to natural light.

At launch, visionOS will support a variety of apps, including native Unity apps, Adobe’s Lightroom, Microsoft Office, medical software, and engineering apps. These applications will take advantage of the unique features offered by visionOS to deliver immersive and engaging user experiences.

Programming Secrets for Apple Vision Pro

Programming for Apple Vision Pro involves understanding the concept of spatial computing and the shared space where apps coexist. In this floating virtual reality, users can open windows, each appearing as planes in the virtual environment. These windows support both traditional 2D views and the integration of 3D content. Here are some programming “secrets” for Apple Vision Pro:

  • All apps exist in 3D space, even if they are basic 2D apps ported from iOS.
  • Consider the Field of View and opt for a landscape screen for user-friendly experiences.
  • Prioritize user comfort and posture by placing content at an optimal distance.
  • Older UIKit apps can be recompiled for VisionOS, gaining some 3D presence features.
  • Be mindful of users’ physical surroundings to ensure a seamless and comfortable experience.

Tools for Apple Vision Pro Development

To initiate the development of applications for Vision Pro, you’ll need a Mac computer running macOS Monterey or a newer version. Additionally, you’ll require the latest release of Xcode and the Vision Pro developer kit.

The development process entails downloading the visionOS SDK and employing familiar tools such as SwiftUI, RealityKit, ARKit, Unity, Reality Composer Pro, and Xcode, which are also utilized for constructing applications on other Apple operating systems.

While it’s feasible to adapt your existing apps for Vision Pro using the visionOS SDK, be prepared for some adjustments in code to accommodate platform differences. Most macOS and iOS apps seamlessly integrate with Vision Pro, preserving their appearance while presenting content within the user’s surroundings as a distinct window.

Now, let’s delve into the essentials for assembling your own Apple Vision Pro development kit:

  • SwiftUI: Ideal for creating immersive experiences by overlaying 3D models onto the real world.
  • Xcode: Apple’s integrated development environment, vital for app development and testing.
  • RealityKit: Exclusively designed for Vision Pro, enabling the creation of lifelike, interactive 3D content.
  • ARKit: Apple’s augmented reality framework for overlaying digital content onto the real world.
  • Unity: A powerful tool for visually stunning games and Vision Pro app development.

Unity is currently actively developing its SDK to interface with Apple Vision Pro. What’s the catch? Few people know that to develop on Unity, you need not just any Mac, but a Mac with an “M” processor on board!

Here are a few more words about supported versions:

  • Unity 2022 LTS (2022.3.191 or newer): Apple Silicon version only.
  • Xcode 15.2: Note that beta versions of Xcode are a no-go.
  • VisionOS 1.0.3 (21N333) SDK: Beta versions are not supported.
  • Unity editor: Apple Silicon Mac and the Apple Silicon macOS build are in; the Intel version is out.

Pay attention to these restrictions during your development journey!

Apple Vision Pro SDK: Empowering Developers

The visionOS Software Development Kit (SDK) is now available, empowering developers to create groundbreaking app experiences for Vision Pro. With tools like Reality Composer Pro, developers can preview and prepare 3D models, animations, and sounds for stunning visuals on Vision Pro. The SDK ensures built-in support for accessibility features, making spatial computing and visionOS apps inclusive and accessible to all users.

As Apple continues to lead the way in spatial computing, developers hold the key to unlocking the full potential of the Vision Pro headset. By understanding the intricacies of visionOS, programming secrets, essential development tools, and the application process for the developer kit, you can position yourself at the forefront of this revolutionary technological landscape.

Latest Articles

September 10, 2025
Immersive Technology & AI for Surgical Intelligence – Going Beyond Visualization

Immersive XR Tech and Artificial Intelligence are advancing MedTech beyond cautious incremental change to an era where data-driven intelligence transforms healthcare. This is especially relevant in the operating room — the most complex and high-stakes environment, where precision, advanced skills, and accurate, real-time data are essential. Incremental Change in Healthcare is No Longer an Option Even in a reality transformed by digital medicine, many operating rooms still feel stuck in an analog past, and while everything outside the OR has moved ahead, transformation has been slow and piecemeal inside it. This lag is more pronounced in complex, demanding surgeries, but immersive technologies convert flat, two-dimensional MRI and CT scans into interactive 3D visualizations. Surgeons now have clearer spatial insight as they work, which reduces the risk of unexpected complications and supports better overall results. Yet, healthcare overall has changed only gradually, although progress has been made over the course of decades. Measures such as reducing fraud, rolling out EMR, and updating clinical guidelines have had limited success in controlling costs and closing quality gaps. For example, the U.S. continues to spend more than other similarly developed countries. Everything calls for a fundamental rethinking of how healthcare is structured and delivered. Can our healthcare systems handle 313M+ surgeries a year? Over 313 million surgeries will likely be performed every year by 2030, putting significant pressure on healthcare systems. Longer waiting times, higher rates of complications, and operating rooms stretched to capacity are all on the rise as a result. Against this backdrop, immersive XR and artificial intelligence are rapidly becoming vital partners in the OR. They turn instinct-driven judgement into visual data-informed planning, reducing uncertainty and supporting confident decision-making. The immediate advantages are clear enough: shorter time spent in the operating room include reduced operating-room time and lower radiation exposure for patients, surgeons, and OR staff. Just as critical, though less visible, are the long-term outcomes. Decreased complication rates and a lower likelihood of revision surgeries are likely to have an even greater impact on the future of the field. These issues have catalyzed the rise of startups in surgical intelligence, whose platforms automate parts of the planning process, support documentation, and employ synthetic imaging to reduce time spent in imaging suites. Synthetic imaging, for clarity, refers to digitally generated images, often created from existing medical scans, that enrich diagnostic and interpretive insights. The latest breakthroughs in XR and AI Processing volumetric data with multimodal generative AI, which divides volumes into sequences of patches or slices, now enables real-time interpretation and assistance directly within VR environments. Similarly, VR-augmented differentiable simulations are proving effective for team-based surgical planning, especially for complex cardiac and neurosurgical cases. They integrate optimized trajectory planners with segmented anatomy and immersive navigation interfaces. Organ and whole-body segmentation, now automated and fast, enables multidisciplinary teams to review patient cases together in XR, using familiar platforms such as 3D Slicer. Meanwhile, DICOM-to-XR visualization workflows built on surgical training platforms like Unity and UE5 have become core building blocks to a wave of MedTech startups that proliferated in 2023–2024, with further integrations across the industry. The future of surgery is here The integration of volumetric rendering and AI-enhanced imaging has equipped surgeons with enhanced visualization, helping them navigate the intersection of surgery and human anatomy in 2023. Such progress led to a marked shift in surgical navigation and planning, becoming vital for meeting the pressing demands currently facing healthcare systems. 1) Surgical VR: Volumetric Digital Twins Recent clinical applications of VR platforms convert MRI/CT DICOM stacks into interactive 3D reconstructions of the patient’s body. Surgeons can explore these models in detail, navigate them as if inside the anatomy itself, and then project them as AR overlays into the operative field to preserve spatial context during incision. Volumetric digital twins function as dynamic, clinically vetted, and true-to-size models, unlike static images. They guide trajectory planning, map procedural risks, and enable remote team rehearsals. According to institutions using these tools, the results include clearer surgical approaches, reduced uncertainty around critical vasculature, and greater confidence among both surgeons and patients. These tools serve multidisciplinary physician teams, not only individual users. Everyone involved can review the same digital twin before and during surgery, working in tight synchronization without the risk of mistakes, especially in complex surgeries such as spinal, cranial, or cardiovascular cases. These pipelines also generate high-fidelity, standardized datasets that support subsequent AI integration, as they mature. Automated segmentation, predictive risk scoring, and differentiable trajectory optimizers can now be layered on top, transforming visual intuition into quantifiable guidance and enabling teams to leave less to chance, delivering safer and less invasive care. The VR platform we built for Vizitech USA serves as a strong example within the parallel and broader domain of healthcare education. VMed-Pro is a virtual-reality training platform built to the standards of the National Registry of Emergency Medical Technicians; the scenarios mirror real-world protocols, ensuring that training translates directly to clinical practice. Beyond procedural skills, VMed-Pro also reinforces core medical concepts; learners can review anatomy and physiology within the context of a virtual patient, connecting textbook knowledge to hands-on clinical judgment. 2) Surgical AR: Intra-operative decision making Augmented reality for surgical navigation combines real-time image registration, AI segmentation, ergonomically designed head-worn glasses, and headsets to convert preoperative DICOM stacks into interactive holographic anatomy, giving surgeons X-ray visualization without diverting gaze from the field – a true Surgical Copilot right in the OR. AI-driven segmentation and computer-vision pipelines generate metric-accurate volumetric models and annotated overlays that support trajectory planning, instrument guidance, and intraoperative decision support. Robust spatial registration and tracking (marker-based or depth-sensor aided) align holograms with patient anatomy to submillimetre accuracy, enabling precise tool guidance and reduced reliance on fluoroscopy. Lightweight AR hardware, featuring hand-tracking and voice control, preserves surgeon ergonomics and minimizes distractions. Cloud and on-premises inference options balance latency and computational power to enable real-time assistance. Significant industry investment and agile startups have driven integration with PACS, navigation systems, and multi-user XR sessions, enhancing preoperative rehearsal and team…

June 27, 2025
Methodology of VR/MR/AR and AI Project Estimation

Estimation of IT projects based on VR, XR, MR, or AI requires both a deep technical understanding of advanced technologies and the ability to predict future market tendencies, potential risks, and opportunities. In this document, we aim to thoroughly examine estimation methodologies that allow for the most accurate prediction of project results in such innovative fields as VR/MR/AR and AI by describing unique approaches and strategies developed by Qualium Systems. We strive to cover existing estimation techniques used at our company and delve into the strategies and approaches that ensure high efficiency and accuracy of the estimation process. While focusing on different estimation types, we analyze the choice of methods and alternative approaches available. Due attention is paid to risk assessment being the key element of a successful IT project implementation, especially in such innovative fields as VR/MR/AR and AI. Moreover, the last chapter covers the demo of a project of ours, the Chemistry education app. We will show how the given approaches practically affect the final project estimation. Read

June 27, 2025
What Are Spatial Anchors and Why They Matter

Breaking Down Spatial Anchors in AR/MR Augmented Reality (AR) and Mixed Reality (MR) depend on accurate understanding of the physical environment to create realistic experiences, and they hit this target with the concept of spatial anchors. These anchors act like markers, either geometric or based on features, that help virtual objects stay in the same spot in the real world — even when users move around. Sounds simple, but the way spatial anchors are implemented varies a lot depending on the platform; for example, Apple’s ARKit, Google’s ARCore, and Microsoft’s Azure Spatial Anchors (ASA) all approach them differently. If you want to know how these anchors are used in practical scenarios or what challenges developers often face when working with them, this article dives into these insights too. What Are Spatial Anchors and Why They Matter A spatial anchor is like a marker in the real world, tied to a specific point or group of features. Once you create one, it allows for some important capabilities: Persistence. Virtual objects stay exactly where you placed them in the real-world, even if you close and restart the app. Multi-user synchronization. Multiple devices can share the same anchor, so everyone sees virtual objects aligned to the same physical space. Cross-session continuity. You can leave a space and come back later, and all the virtual elements will still be in the right place. In AR/MR, your device builds a point cloud or feature map by using the camera and built-in sensors like the IMU (inertial measurement unit). Spatial anchors are then tied to those features, and without them, virtual objects can drift or float around as you move, shattering the sense of immersion. Technical Mechanics of Spatial Anchors At a high level, creating and using spatial anchors involves a series of steps: Feature Detection & Mapping To start, the device needs to understand its surroundings: it scans the environment to identify stable visual features (e.g., corners, edges). Over time, these features are triangulated, forming a sparse map or mesh of the space. This feature map is what the system relies on to anchor virtual objects. Anchor Creation Next, anchors are placed at specific 3D locations in the environment in two possible ways: Hit-testing. The system casts a virtual ray from a camera to a user-tapped point, then drops an anchor on the detected surface. Manual placement. Sometimes, developers need precise control, so they manually specify the exact location of an anchor using known coordinates, like ensuring it perfectly fits on the floor or another predefined plane. Persistence & Serialization Anchors aren’t temporary — they can persist, and here’s how systems make that possible: Locally stored anchors. Frameworks save the anchor’s data, like feature descriptors and transforms, in a package called a “world map” or “anchor payload”. Cloud-based anchors. Cloud services like Azure Spatial Anchors (ASA) upload this anchor data to a remote server to let the same anchor be accessed across multiple devices. Synchronization & Restoration When you’re reopening the app or accessing the anchor on a different device, the system uses the saved data to restore the anchor’s location. It compares stored feature descriptors to what the camera sees in real time, and if there’s a good enough match, the system confidently snaps the anchor into position, and your virtual content shows up right where it’s supposed to. However, using spatial anchors isn’t perfect, like using any other technology, and there are some tricky issues to figure out: Low latency. Matching saved data to real-time visuals has to be quick; otherwise, the user experience feels clunky. Robustness in feature-scarce environments. Blank walls or textureless areas don’t give the system much to work with and make tracking tougher. Scale drift. Little errors in the system’s tracking add up over time to big discrepancies. When everything falls into place and the challenges are handled well, spatial anchors make augmented and virtual reality experiences feel seamless and truly real. ARKit’s Spatial Anchors (Apple) Apple’s ARKit, rolled out with iOS 11, brought powerful features to developers working on AR apps, and one of them is spatial anchoring, which allows virtual objects to stay fixed in the real world as if they belong there. To do this, ARKit provides two main APIs that developers rely on to achieve anchor-based persistence. ARAnchor & ARPlaneAnchor The simplest kind of anchor in ARKit is the ARAnchor, which represents a single 3D point in the real-world environment and acts as a kind of “pin” in space that ARKit can track. Building on this, ARPlaneAnchor identifies flat surfaces like tables, floors, and walls, allowing developers to tie virtual objects to these surfaces. ARWorldMap ARWorldMap makes ARKit robust for persistence and acts as a snapshot of the environment being tracked by ARKit. It captures the current session, including all detected anchors and their surrounding feature points, into a compact file. There are a few constraints developers need to keep in mind: World maps are iOS-only, which means they cannot be shared directly with Android. There must be enough overlapping features between the saved environment and the current physical space, and textured structures are especially valuable for this, as they help ARKit identify key points for alignment. Large world maps, especially those with many anchors or detailed environments, can be slow to serialize and deserialize, causing higher application latency when loading or saving. ARKit anchors are ideal for single-user persistence, but sharing AR experiences across multiple devices poses additional issues, and developers often employ custom server logic (uploading ARWorldMap data to a backend), enabling users to download and use the same map. However, this approach comes with caveats: it requires extra development work and doesn’t offer native support for sharing across platforms like iOS and Android. ARCore’s Spatial Anchors (Google) Google’s ARCore is a solid toolkit for building AR apps, and one of its best features is how it handles spatial anchors: Anchors & Hit-Testing ARCore offers two ways to create anchors. You can use Session.createAnchor(Pose) if you already know the anchor’s position, or…



Let's discuss your ideas

Contact us