Python操作hdfs,,Python直接操作
Python操作hdfs,,Python直接操作
Python直接操作hdfs,包括追加数据文件到hdfs文件
#!coding:utf-8import sys from hdfs.client import Client#设置utf-8模式reload(sys)sys.setdefaultencoding( "utf-8" )#关于python操作hdfs的API可以查看官网:#https://hdfscli.readthedocs.io/en/latest/api.html #读取hdfs文件内容,将每行存入数组返回def read_hdfs_file(client,filename): #with client.read(‘samples.csv‘, encoding=‘utf-8‘, delimiter=‘\n‘) as reader: # for line in reader: #pass lines = [] with client.read(filename, encoding=‘utf-8‘, delimiter=‘\n‘) as reader: for line in reader: #pass #print line.strip() lines.append(line.strip()) return lines #创建目录def mkdirs(client,hdfs_path) : client.makedirs(hdfs_path) #删除hdfs文件def delete_hdfs_file(client,hdfs_path): client.delete(hdfs_path) #上传文件到hdfsdef put_to_hdfs(client,local_path,hdfs_path): client.upload(hdfs_path, local_path,cleanup=True) #从hdfs获取文件到本地def get_from_hdfs(client,hdfs_path,local_path): download(hdfs_path, local_path, overwrite=False)#追加数据到hdfs文件 def append_to_hdfs(client,hdfs_path,data): client.write(hdfs_path, data,overwrite=False,append=True) #覆盖数据写到hdfs文件def write_to_hdfs(client,hdfs_path,data): client.write(hdfs_path, data,overwrite=True,append=False) #移动或者修改文件 def move_or_rename(client,hdfs_src_path, hdfs_dst_path): client.rename(hdfs_src_path, hdfs_dst_path) #返回目录下的文件def list(client,hdfs_path): return client.list(hdfs_path, status=False) #client = Client(url, root=None, proxy=None, timeout=None, session=None) #client = Client("http://hadoop:50070") #move_or_rename(client,‘/input/2.csv‘, ‘/input/emp.csv‘)#read_hdfs_file(client,‘/input/emp.csv‘)#put_to_hdfs(client,‘/home/shutong/hdfs/1.csv‘,‘/input/‘)#append_to_hdfs(client,‘/input/emp.csv‘,‘我爱你‘+‘\n‘)#write_to_hdfs(client,‘/input/emp.csv‘,‘我爱你‘+‘\n‘)#read_hdfs_file(client,‘/input/emp.csv‘)#move_or_rename(client,‘/input/emp.csv‘, ‘/input/2.csv‘)#mkdirs(client,‘/input/python‘)#print list(client,‘/input/‘)#chown(client,‘/input/1.csv‘, ‘root‘)
Python操作hdfs
相关内容
- Ubuntu16.04 python3.4.3升级到python3.7.1,,python有两个版
- python启程——第一篇,,关于自己python
- python实现两个经纬度点之间的距离和方位角,,from:h
- python-按照一定规律去分割一个列表,,a=[1,2,3,4
- Python中pymysql基本使用,,Python中pym
- python函数题,,6-1 jmu-py
- 20200114 python3.8下载及安装,,操作系统:Windo
- python网络编程-udp,,目录1. 创建soc
- Python字典,,1、字典类型定义 (
- kazoo python zookeeper 选主,,本文讲述基于zook
评论关闭