Python遞歸遍歷文件夾搜索文件腳本

千鋒Python學堂 2019-10-19

展開全文

開發(fā)背景：

電腦的E盤里有很多電子書,，以前對個技術比較感興趣就去下載很多電子書,，有些看了，有些沒看,，這些電子書沒有在一個地方,，于是我準備寫一個腳本，將這個電子書書搜索出來,，進行整理一下,。

程序設計的思路：

定義一個搜索的根目錄baseDir，一個不搜索的文件夾列表notSearhFolderArr,，一個搜索的文件類型列表searchTypeArr,，

判斷根目錄baseDir是有效的，并且不存在于notSearhFolderArr數(shù)組中,，

獲取文件夾下的所有文件及文件夾,，

遍歷，判斷子元素是文件就,，判斷文件類型是否存在于searchTypeArr,，如果存在返回路徑

判斷子元素，是文件夾并且不屬于notSearhFolderArr數(shù)組中,，執(zhí)行第一步,，進行遞歸搜索

代碼：

 # 根據(jù)配置好的文件，搜索文件夾import osimport ioimport sys
sys.stdout = io.TextIOWrapper(sys.stdout.buffer,encoding='utf8')# 主函數(shù)baseDir = "E:\\Pang\\for_search" # 搜索的根目錄notSearchFolderArr = ['node_modules'] # 不搜索的目錄searchFileTypeArr = ['.pdf','.PDF'] # 搜索的文件類型def searhMain():
 allResArr = searchFolder(baseDir)
 print('\n'.join(allResArr))# 搜索一個文件目錄 傳入一個文件目錄路徑def searchFolder(folderPath):
 folderName = os.path.split(folderPath)[-1]
 searFilePathArr = [] if os.path.exists(folderPath) and (folderName not in notSearchFolderArr):
 fileArr = os.listdir(folderPath) for item in fileArr:
 currentPath = folderPath+'\\'+item
 (fileName,fileType) = os.path.splitext(item) if os.path.isfile(currentPath) and (fileType in searchFileTypeArr):
 searFilePathArr.append(currentPath) if os.path.isdir(currentPath) and (item not in notSearchFolderArr):
 innerFileArr = searchFolder(currentPath)
 searFilePathArr.extend(innerFileArr) return searFilePathArr
searhMain()

主要用到的模塊和api：

模塊 os：操作文件的模塊

主要api：

os.path.split ： 分割路徑
os.path.exists： 路徑是否存在
os.listdir： 路徑是否是文件夾
os.path.splitext：拆分路徑中的文件擴展名于其他
os.path.isfile： 路徑是否是文件
append： 向數(shù)組中追加一個元素
extend: 向數(shù)組追加一個數(shù)組

運行結果：

程序返回的事根目錄下所有的pdf文件路徑列表

Python遞歸遍歷文件夾搜索文件腳本