2020年新年大赛！

Question

Vadim Yermolenko

Asked:2020-03-12 18:39:23 +0000 UTC2020-03-12 18:39:23 +0000 UTC 2020-03-12 18:39:23 +0000 UTC

如何将文件中的行从一个关键字删除到另一个关键字？

772

我有一个包含以下内容的文件：

        CREATE TABLE some_name (
fv int,
sv int,
tv int)
CLUSTERED BY (fv,
              sv,
              tv) 
SORTED BY (fv,
           sv,
           tv) INTO 2 BUCKETS;
-- more text afterwards

例如，我需要确保脚本会删除从 Clustered inclusive 到 Buckets inclusive 的所有单词，但不会删除分号。代码如何实现？我被提供了这个：

start_word = "CLUSTERED"
end_word = "BUCKETS"

result_lines = []

with open(target_file, 'r') as f:
    erasing = False
    for line in f:
        if not erasing and start_word in line:
            // begin erasing lines
            erasing = True
            continue

        if erasing and end_word in line:
            // finished erasing lines
            erasing = False
            continue

        if erasing:
            // we are between the start and end of the section we want to erase
            continue
        else:
            // either we haven't started erasing or we have already finished
            result_lines.append(line)

print('\n'.join(result_lines))

但它会删除分号，一般来说，所有与 Clustered 和 Buckets 一致的东西都会被删除。结果应该是这样的：

 CREATE TABLE some_name (
fv int,
sv int,
tv int)
-- more text afterwards;

2 个回答

Voted

MaxU - stop genocide of UA · Answer 1 · 2020-03-12T18:45:47Z

Best Answer

MaxU - stop genocide of UA

2020-03-12T18:45:47Z2020-03-12T18:45:47Z

使用正则表达式：

import re

#text = """
#CREATE TABLE some_name (
#fv int,
#sv int,
#tv int)
#CLUSTERED BY (fv,
#              sv,
#              tv) 
#SORTED BY (fv,
#           sv,
#           tv) INTO 2 BUCKETS;
#-- more text afterwards
#"""

with open(target_file, 'r') as f:
    text = f.read()
res = re.sub('CLUSTERED[\s\b\n\r]+[^;]*', '', text)
print(res)

CREATE TABLE some_name (
fv int,
sv int,
tv int)
;
-- more text afterwards

3

gil9red · Answer 2 · 2020-03-12T18:52:42Z

gil9red

2020-03-12T18:52:42Z2020-03-12T18:52:42Z

带有正则表达式的示例：

import re

text = """\
CREATE TABLE some_name (
fv int,
sv int,
tv int)
CLUSTERED BY (fv,
              sv,
              tv) 
SORTED BY (fv,
           sv,
           tv) INTO 2 BUCKETS;
-- more text afterwards
"""

new_text = re.sub('Clustered.+?Buckets;', ';', text, flags=re.I | re.DOTALL)
print(new_text)

结果：

CREATE TABLE some_name (
fv int,
sv int,
tv int)
;
-- more text afterwards

1

如何将文件中的行从一个关键字删除到另一个关键字？

根据浏览器窗口的大小调整背景图案的大小

理解for循环的执行逻辑

复制动态数组时出错（C++）

Or and If,elif,else 构造[重复]

如何构建支持 x64 的 APK

如何使按钮的输入宽度？

如何显示对象变量的名称？

如何循环一个函数？

LOWORD 宏有什么作用？

从字符串的开头删除直到并包括一个字符

如何将文件中的行从一个关键字删除到另一个关键字？

2 个回答

相关问题