Does the autoloader in Databricks load the same file again if the file is loaded from another path ?
Or does it load the same file if the file is put again in the same directory after some time ?
Does the autoloader in Databricks load the same file again if the file is loaded from another path ?
Or does it load the same file if the file is put again in the same directory after some time ?
if the file is loaded from another path ?
yes, even the same file irrespective of its content, filename, timestamp and other additional metadata, is loaded if the path mentioned is different
if the file is added to the same directory after some time ?
yes, only if the contents or/and the filename has been changed, any changes or modification to the file is loaded again
basically it uses file fingerprint to avoid redundant loading of the same files, so when a file with different fingerprint is provided it loads the file