RError.com

RError.com Logo RError.com Logo

RError.com Navigation

  • 主页

Mobile menu

Close
  • 主页
  • 系统&网络
    • 热门问题
    • 最新问题
    • 标签
  • Ubuntu
    • 热门问题
    • 最新问题
    • 标签
  • 帮助
主页 / 问题 / 1098848
Accepted
MaxU - stop genocide of UA
MaxU - stop genocide of UA
Asked:2020-03-24 05:33:13 +0000 UTC2020-03-24 05:33:13 +0000 UTC 2020-03-24 05:33:13 +0000 UTC

使用自动列宽、列自动过滤器等将 CSV/DataFrame 保存到 Excel。

  • 772

需要自动将 CSV 转换为 Excel,以便自动设置适当的列宽,自动设置列“自动过滤器”,并自动冻结带有列名称的顶行等。

使用将 CSV 文件转换为 Excel 格式的 Pandas 模块的解决方案:

import pandas as pd

pd.read_csv("filename.csv").to_excel("filename.xlsx", index=False)

问题:但是自动“自动过滤器”、适应数据宽度或列名宽度的列宽以及冻结列名的行呢?

python
  • 2 2 个回答
  • 10 Views

2 个回答

  • Voted
  1. MaxU - stop genocide of UA
    2020-03-24T05:33:13Z2020-03-24T05:33:13Z

    下面的功能允许您自动设置列“自动过滤器”,使列宽适应数据宽度或列名宽度,以及“冻结”顶部行和/或左侧列:

    from pathlib import Path
    import pandas as pd
    
    
    def df_to_excel_auto_fmt(
            df,
            fn,
            max_col_width=30,
            autofilter=True,
            freeze_panes=(1, 0),
            fmt_int="#,##0",
            fmt_float="#,##0.00", 
            fmt_date="yyyy-mm-dd",
            fmt_datetime="yyyy-mm-dd hh:mm:ss",
            **kwargs):
        """
        Export / save Pandas.DataFrame to an Excel file with automatically adjusted column's widths.
        It can also add Excel column "autofilters" and freeze panes (top rows and left columns).
        Cell values will be formatted according to "fmt_*" parameters.
    
        :param df: DataFrame to be exported to Excel
        :param fn: output Excelfile name. NOTE: the extension will be changed to ".xlsx"
        :param max_col_width: maximum column width in Excel. Default: 30
        :param autofilter: boolean - whether add Excel autofilter or not. Default: True
        :param freeze_panes: tuple of int (length 2).
                             Specifies the one-based bottommost row and rightmost column
                             that is to be frozen.
        :param fmt_int: Excel format for integer numbers
        :param fmt_float: Excel format for float numbers
        :param fmt_date: Excel format for dates
        :param fmt_datetime: Excel format for datetime's
        :param kwargs: additional arguments to pass to df.to_excel(filename, **kwargs)
        :return:  None
        """
        file = Path(fn).with_suffix(".xlsx")
        # get default parameters
        first_col = int(kwargs.get("index", True))
        sheet_name = kwargs.get("sheet_name", "Sheet1")
        if "freeze_panes" not in kwargs:
            kwargs["freeze_panes"] = freeze_panes
        writer = pd.ExcelWriter(
            file.with_suffix(".xlsx"), 
            engine="xlsxwriter",
            date_format=fmt_date,
            datetime_format=fmt_datetime)
        df.to_excel(writer, sheet_name=sheet_name, **kwargs)
        workbook = writer.book
        worksheet = writer.sheets[sheet_name]
        int_fmt = workbook.add_format({'num_format': fmt_int})
        float_fmt = workbook.add_format({'num_format': fmt_float})
        for xl_col_no, dtyp in enumerate(df.dtypes, first_col):
            col_no = xl_col_no - first_col
            width = max(df.iloc[:, col_no].astype(str).str.len().max(), 
                        len(df.columns[col_no]) + 6)
            width = min(max_col_width, width)
            # print(f"column: [{df.columns[col_no]}]\twidth:\t[{width}]")
            if np.issubdtype(dtyp, np.integer):
                worksheet.set_column(xl_col_no, xl_col_no, width, int_fmt)
            elif np.issubdtype(dtyp, np.floating):
                worksheet.set_column(xl_col_no, xl_col_no, width, float_fmt)
            else:
                worksheet.set_column(xl_col_no, xl_col_no, width)
        if autofilter:
            worksheet.autofilter(0, 0, 0, df.shape[1] + first_col)
        writer.save()
        writer.close()
    

    使用示例:

    fn = r"c:\download\file.csv"
    df = pd.read_csv(fn, sep=";")
    df_to_excel_auto_fmt(
        df,
        fn,
        max_col_width=30,
        fmt_datetime="dd.mm.yy hh:mm",
        index=False)
    
    • 4
  2. Best Answer
    MaxU - stop genocide of UA
    2020-09-15T06:49:03Z2020-09-15T06:49:03Z

    下次更新模块后,基于的版本XlsxWriter停止工作,所以我决定使用以下方法重写相同的函数openpyxl:

    def df_to_excel_auto_fmt(
            df,
            fn,
            max_col_width=30,
            autofilter=True,
            freeze_panes=(1, 0),
            fmt_int="#,##0",
            fmt_float="#,##0.00",
            fmt_date="yyyy-mm-dd",
            fmt_datetime="yyyy-mm-dd hh:mm:ss",
            **kwargs):
        """
        Export / save Pandas.DataFrame to an Excel file with automatically adjusted column's widths.
        It can also add Excel column "autofilters" and freeze panes (top rows and left columns).
        Cell values will be formatted according to "fmt_*" parameters.
    
        :param df: DataFrame to be exported to Excel
        :param fn: output Excelfile name. NOTE: the extension will be changed to ".xlsx"
        :param max_col_width: maximum column width in Excel. Default: 30
        :param autofilter: boolean - whether add Excel autofilter or not. Default: True
        :param freeze_panes: tuple of int (length 2).
                             Specifies the one-based bottommost row and rightmost column
                             that is to be frozen.
        :param fmt_int: Excel format for integer numbers
        :param fmt_float: Excel format for float numbers
        :param fmt_date: Excel format for dates
        :param fmt_datetime: Excel format for datetime's
        :param kwargs: additional arguments to pass to df.to_excel(filename, **kwargs)
        :return:  None
    
        (c) https://ru.stackoverflow.com/users/211923/maxu?tab=profile
        """
        from openpyxl.utils import get_column_letter
    
        def set_column_format(ws, column_letter, fmt):
            for cell in ws[column_letter]:
                cell.number_format = fmt
        file = Path(fn).with_suffix(".xlsx")
        # get default parameters
        first_col = int(kwargs.get("index", True)) + 1
        sheet_name = kwargs.get("sheet_name", "Sheet1")
        if "freeze_panes" not in kwargs:
            kwargs["freeze_panes"] = freeze_panes
        writer = pd.ExcelWriter(
            file.with_suffix(".xlsx"),
            engine="openpyxl",
            date_format=fmt_date,
            datetime_format=fmt_datetime)
        df.to_excel(writer, sheet_name=sheet_name, **kwargs)
        # workbook = writer.book
        worksheet = writer.sheets[sheet_name]
        for xl_col_no, dtyp in enumerate(df.dtypes, first_col):
            col_no = xl_col_no - first_col
            width = max(df.iloc[:, col_no].astype(str).str.len().max(),
                        len(df.columns[col_no]) + 6)
            width = min(max_col_width, width)
            # print(f"column: [{df.columns[col_no]} ({dtyp.name})]\twidth:\t[{width}]")
            column_letter = get_column_letter(xl_col_no)
            worksheet.column_dimensions[column_letter].width = width
            if np.issubdtype(dtyp, np.integer):
                set_column_format(worksheet, column_letter, fmt_int)
            if np.issubdtype(dtyp, np.floating):
                set_column_format(worksheet, column_letter, fmt_float)
        if autofilter:
            worksheet.auto_filter.ref = worksheet.dimensions
        writer.save()
        writer.close()
    

    使用示例:

    fn = r"c:\download\file.csv"
    df = pd.read_csv(fn, sep=";")
    df_to_excel_auto_fmt(
        df,
        fn,
        max_col_width=30,
        fmt_datetime="dd.mm.yy hh:mm",
        index=False)
    
    • 4

相关问题

  • 是否可以以某种方式自定义 QTabWidget?

  • telebot.anihelper.ApiException 错误

  • Python。检查一个数字是否是 3 的幂。输出 无

  • 解析多个响应

  • 交换两个数组的元素,以便它们的新内容也反转

Sidebar

Stats

  • 问题 10021
  • Answers 30001
  • 最佳答案 8000
  • 用户 6900
  • 常问
  • 回答
  • Marko Smith

    如何从列表中打印最大元素(str 类型)的长度?

    • 2 个回答
  • Marko Smith

    如何在 PyQT5 中清除 QFrame 的内容

    • 1 个回答
  • Marko Smith

    如何将具有特定字符的字符串拆分为两个不同的列表?

    • 2 个回答
  • Marko Smith

    导航栏活动元素

    • 1 个回答
  • Marko Smith

    是否可以将文本放入数组中?[关闭]

    • 1 个回答
  • Marko Smith

    如何一次用多个分隔符拆分字符串?

    • 1 个回答
  • Marko Smith

    如何通过 ClassPath 创建 InputStream?

    • 2 个回答
  • Marko Smith

    在一个查询中连接多个表

    • 1 个回答
  • Marko Smith

    对列表列表中的所有值求和

    • 3 个回答
  • Marko Smith

    如何对齐 string.Format 中的列?

    • 1 个回答
  • Martin Hope
    Alexandr_TT 2020年新年大赛! 2020-12-20 18:20:21 +0000 UTC
  • Martin Hope
    Alexandr_TT 圣诞树动画 2020-12-23 00:38:08 +0000 UTC
  • Martin Hope
    Air 究竟是什么标识了网站访问者? 2020-11-03 15:49:20 +0000 UTC
  • Martin Hope
    Qwertiy 号码显示 9223372036854775807 2020-07-11 18:16:49 +0000 UTC
  • Martin Hope
    user216109 如何为黑客设下陷阱,或充分击退攻击? 2020-05-10 02:22:52 +0000 UTC
  • Martin Hope
    Qwertiy 并变成3个无穷大 2020-11-06 07:15:57 +0000 UTC
  • Martin Hope
    koks_rs 什么是样板代码? 2020-10-27 15:43:19 +0000 UTC
  • Martin Hope
    Sirop4ik 向 git 提交发布的正确方法是什么? 2020-10-05 00:02:00 +0000 UTC
  • Martin Hope
    faoxis 为什么在这么多示例中函数都称为 foo? 2020-08-15 04:42:49 +0000 UTC
  • Martin Hope
    Pavel Mayorov 如何从事件或回调函数中返回值?或者至少等他们完成。 2020-08-11 16:49:28 +0000 UTC

热门标签

javascript python java php c# c++ html android jquery mysql

Explore

  • 主页
  • 问题
    • 热门问题
    • 最新问题
  • 标签
  • 帮助

Footer

RError.com

关于我们

  • 关于我们
  • 联系我们

Legal Stuff

  • Privacy Policy

帮助

© 2023 RError.com All Rights Reserve   沪ICP备12040472号-5