Find file paths folder
- operators.sys.find_file_paths_folder(client, input_folders='/input_folders', output_folders='/output_folders', search_folder='/search_folder', replace_in='', replace_out='', substrings='', worker_instance_type='x2large', manager_instance_type='small', extension_input_files='.txt', extension_output_files='.txt', skip_existing_files=False)
- find_file_paths_folder(client,input_folders=’/input_folders’,output_folders=’/output_folders’,search_folder=’/search_folder’,replace_in=’’,replace_out=’’,substrings=’’,worker_instance_type=’x2large’,manager_instance_type=”small”,extension_input_folders=”.txt”,extension_output_folders=”.txt”,skip_existing_files = False )
- Parameters:
search_folder – Folder to traverse for finding files
replace_in – The part to replace in the filenames
replace_out – The new part to replace with
substrings – a list of substrings that need to occure in the file paths to be vallid
input_folders – File containing the list of foldernames
output_folders – Path to save the modified folderlist
worker_instance_type – cloud instance type of worker nodes
manager_instance_type – cloud instance type of manager node
extension_input_folders – File extension of files in folder for input_folders
extension_output_folders – File extension of files in folder for output_folders
skip_existing_files – skip files that already exist in the output folder