Overview
In this step, you’ll apply a Data Fusion model to the input features you selected earlier in your experiment. Data Fusion transforms your raw rasters into a set of compact, information-rich layers that help the predictive model better understand geological structure, texture, and spatial relationships.
This dimensionality-reduction process adapts a globally trained model to your specific AOI and geological context. The result is a new set of Embedded Visual Layers, which you can review and assess before moving to the next stage.
Why do we Apply Data Fusion?
Applying Data Fusion to your input features means transforming your selected rasters into a set of 24 lower-dimensional representations (embeddings) that preserve meaningful spatial patterns and relationships. This is done using a Data Fusion Model, which is a pre-trained AI model trained on hundreds of thousands of rasters across specific mineral system types.
The data fusion process captures:
Spatial relationships across rasters (how different layers relate to one another)
Spatial structure within each raster (how features are arranged in space)
The Data Fusion Model, trained on large-scale data, is then fine-tuned with your selected rasters and learning points. This produces 24 new embedded rasters that are optimized for your AOI and geological context.
Why This Step Matters
Geoscience Perspective
Data Fusion helps isolate geological patterns that might not be immediately visible in raw data. By using a Data Fusion model trained on similar mineral systems, you can leverage global insights and tailor them to your specific project area. This increases confidence in your predictions and helps ensure the model is grounded in domain-relevant geoscience.
View the full list of available Data Fusion models and the mineral systems they are trained on here.
AI Perspective
This step applies dimensionality reduction to your dataset, making it more manageable for machine learning while preserving essential spatial and structural features.
Selecting the right combination of rasters improves the model’s ability to recognize geological patterns during training. The number of epochs you set controls how much the model learns from your local data, allowing it to fine-tune predictions specific to your project. DORA includes an early stopping mechanism that automatically halts training once performance stops improving, helping to avoid overfitting and unnecessary computation.
This is a form of few-shot learning. The model has already learned from hundreds of thousands of rasters across global projects, but it adapts to your selected input features by learning just enough to optimize predictions for your geological setting.
💡Tip: When adjusting the number of training epochs, consider the trade-offs:
|
Step by Step Instructions
Open Apply Data Fusion
From the experiment setup panel, click Step 4: Apply Data Fusion.
Select a Data Fusion Model
Set the Number of Training Epochs
Click Apply Data Fusion
This triggers the dimensionality reduction and creates rasterized outputs for each input feature.
(Optional) Review the Visual Results
Use the 3D Layers List to preview the generated raster layers. Toggle visibility (eye icon) and adjust the color scheme or base displacement scale to better visualize textures and surface patterns.
While reviewing, look for signs of artifacts — visual inconsistencies caused by mixing high-resolution datasets with coarser grids that cover the full AOI. These can appear as abrupt edges or linear patterns within the embedded layers.
These artifacts may lead to skewed VRIFY Prospectivity Score (VPS) results, such as overly prospective areas around the edges of smaller surveys.
If this happens, try removing the problematic layer and rerun the embeddings to see if the issue resolves.
Complete Step
Click Proceed to move to the next stage.
Tips & Considerations
Data Fusion Model Naming
Model names like c2p4 refer to the number of training images used (e.g., c2p4 represents 24,000 training images).
Prioritize models trained on the correct deposit type, then select the one with the highest training count for the most reliable results.
Did you notice the naming convention c2p4, c1p8, etc? It’s a reference to C-3PO from Star Wars!
Epoch Strategy
If results aren’t improving with additional epochs, stick with the early-stopped model — it’s likely already optimized.
If you select the None Data Fusion Model, consider using a higher number of epochs (e.g., 300) to allow for deeper refinement, since there are fewer pre-trained images available to guide the embedding process.
Learn More
Previous Steps:
Next Steps:
Still Have Questions?
Reach out to your dedicated DORA contact or email support@VRIFY.com for more information.




