The Python Oracle

Can't store downloaded files in their concerning folders

Become part of the top 3% of the developers by applying to Toptal https://topt.al/25cXVn

--

Music by Eric Matyas
https://www.soundimage.org
Track title: Techno Intrigue Looping

--

Chapters
00:00 Question
02:05 Accepted answer (Score 6)
03:17 Answer 2 (Score 0)
03:47 Answer 3 (Score -1)
04:13 Thank you

--

Full question
https://stackoverflow.com/questions/5462...

Question links:
[image]: https://i.stack.imgur.com/XrUvW.jpg

Answer 2 links:
[pathlib]: https://docs.python.org/3.7/library/path...
[pathlib2]: https://pypi.org/project/pathlib2/

--

Content licensed under CC BY-SA
https://meta.stackexchange.com/help/lice...

--

Tags
#python #python3x #selenium #seleniumwebdriver #webscraping

#avk47



ACCEPTED ANSWER

Score 6


I just added the the rename of the file to move it. So it'll work just as you have it, but then once it downloads the file, will move it to the correct path:

os.rename(desk_location + '\\' + filename, file_location)

Full Code:

import os
import time
from selenium import webdriver

link ='https://www.online-convert.com/file-format/doc' 

dirf = os.path.expanduser('~')
desk_location = dirf + r'\Desktop\file_folder'
if not os.path.exists(desk_location):
    os.mkdir(desk_location)

def download_files():
    driver.get(link)
    for item in driver.find_elements_by_css_selector("a[href$='.doc']")[:2]:
        filename = item.get_attribute("href").split("/")[-1]
        #creating new folder in accordance with filename to store the downloaded file in thier concerning folder
        folder_name = item.get_attribute("href").split("/")[-1].split(".")[0]
        #set the new location of the folders to be created
        new_location = os.path.join(desk_location,folder_name)
        if not os.path.exists(new_location):
            os.mkdir(new_location)
        #set the location of the folders the downloaded files will be within
        file_location = os.path.join(new_location,filename)
        item.click()

        time_to_wait = 10
        time_counter = 0

        try:
            while not os.path.exists(file_location):
                time.sleep(1)
                time_counter += 1
                if time_counter > time_to_wait:break
            os.rename(desk_location + '\\' + filename, file_location)
        except Exception:pass

if __name__ == '__main__':
    chromeOptions = webdriver.ChromeOptions()
    prefs = {'download.default_directory' : desk_location,
            'profile.default_content_setting_values.automatic_downloads': 1
        }
    chromeOptions.add_experimental_option('prefs', prefs)
    driver = webdriver.Chrome(chrome_options=chromeOptions)
    download_files()



ANSWER 2

Score 0


Use this code while declaring the Driver object (This is for Java, Python will also have a similar way to accomplish it) This will download the file to the specified location every time.

    //Create preference object
    HashMap<String, Object> chromePrefs = new HashMap<String , Object>();   
    //Set Download path  
    chromePrefs.put("download.default_directory","C:\\Reports\\AutomaionDownloads");
        chromePrefs.put("download.directory_upgrade", true);
        ChromeOptions options = new ChromeOptions();
        options.setExperimentalOption("prefs", chromePrefs);    
        //Call the Chrome Driver
        WebDriver driver = new ChromeDriver(options);