As Azure Machine Learning has more maturity compared to Synapse Data Science from Fabric that I recently been using, I wanted to know:
- If there is a way to access OneLake data (Files or Tables) from an Azure ML instance, within the same subscription ?
- If so, how to do it ? (I did not find any documentation or tutorial for this case. The most similarish thing I found is an explanation on how to make model endpoint from Azure ML available in Fabric)
- What are the possible bottlenecks (data transfer if I need to use an potential intermediate storage for example)
Thanks in advance
2
Answers
FYI: I've managed to do it this way, as the OneLake connector might be struggling to exactly match the expected variables of the api. For my situation, to reach for Tables in a OneLake on Fabric:
datastore.yml
In the cloud shell, after uploading the yml file:
Inside a notebook, using mltable:
Works like a charm !
To access OneLake data from an Azure ML instance within the same subscription, you can use Azure Data Lake Storage Gen2 (ADLS Gen2) as the intermediate storage.
You can use the
Datastore
andDataset
classes in the Azure ML SDK to access data.I hope this helps 🙂