How can I use a sliding window to solve "Count Substrings That Can Be Rearranged to Contain a String I"?

By maintaining a sliding window of the same length as word2 and updating the character frequency as it moves through word1, you can efficiently determine if a substring can be rearranged to contain word2.

What is the time complexity of solving this problem?

The time complexity is O(n), where n is the length of word1, since each character is processed once as the window slides across the string.

What should I do if word2 is longer than word1?

If word2 is longer than word1, no valid substrings can exist, so you can immediately return 0.

How do I optimize the space complexity for this problem?

The space complexity can be reduced by using a fixed-size hash table (of size 26 for lowercase English letters) to store character frequencies, making it constant space O(1).

Can the sliding window approach work for other types of substring problems?

Yes, the sliding window technique is versatile and can be used for many substring-related problems, especially when looking for substrings that meet specific conditions.

#3297

Medium

auto_awesome滑动窗口（状态滚动更新）

LeetCode 题解工作台

统计重新排列后包含另一个字符串的子字符串数目 I

给你两个字符串 word1 和 word2 。如果一个字符串 x 重新排列后， word2 是重排字符串的前缀，那么我们称字符串 x 是合法的。请你返回 word1 中合法子字符串的数目。示例 1：输入： word1 = "bcca", word2 = "abc" 输出： 1 …

哈希表字符串滑动窗口

题目描述

给你两个字符串 word1 和 word2 。

如果一个字符串 x 重新排列后，word2 是重排字符串的前缀，那么我们称字符串 x 是 合法的 。

请你返回 word1 中合法子字符串的数目。

示例 1：

输入：word1 = "bcca", word2 = "abc"

输出：1

解释：

唯一合法的子字符串是 "bcca" ，可以重新排列得到 "abcc" ，"abc" 是它的前缀。

示例 2：

输入：word1 = "abcabc", word2 = "abc"

输出：10

解释：

除了长度为 1 和 2 的所有子字符串都是合法的。

示例 3：

输入：word1 = "abcabc", word2 = "aaabc"

输出：0

解释：

1 <= word1.length <= 10⁵
1 <= word2.length <= 10⁴
word1 和 word2 都只包含小写英文字母。

lightbulb

解题思路

方法一：滑动窗口

题目实际上是求在 $\textit{word1}$ 中，有多少个子串包含了 $\textit{word2}$ 中的所有字符。我们可以使用滑动窗口来处理。

首先，如果 $\textit{word1}$ 的长度小于 $\textit{word2}$ 的长度，那么 $\textit{word1}$ 中不可能包含 $\textit{word2}$ 的所有字符，直接返回 $0$ 。

接下来，我们用一个哈希表或长度为 $26$ 的数组 $\textit{cnt}$ 来统计 $\textit{word2}$ 中的字符出现的次数。然后，我们用 $\textit{need}$ 来记录还需要多少个字符才能满足条件，初始化为 $\textit{cnt}$ 的长度。

接着，我们用一个滑动窗口 $\textit{win}$ 来记录当前窗口中的字符出现的次数。我们用 $\textit{ans}$ 来记录满足条件的子串的个数，用 $\textit{l}$ 来记录窗口的左边界。

遍历 $\textit{word1}$ 中的每个字符，对于当前字符 $c$ ，我们将其加入到 $\textit{win}$ 中，如果 $\textit{win}[c]$ 的值等于 $\textit{cnt}[c]$ ，那么说明当前窗口中已经包含了 $\textit{word2}$ 中的所有字符之一，那么 $\textit{need}$ 减一。如果 $\textit{need}$ 等于 $0$ ，说明当前窗口中包含了 $\textit{word2}$ 中的所有字符，我们需要缩小窗口的左边界，直到 $\textit{need}$ 大于 $0$ 。具体地，如果 $\textit{win}[\textit{word1}[l]]$ 等于 $\textit{cnt}[\textit{word1}[l]]$ ，那么说明当前窗口中包含了 $\textit{word2}$ 中的所有字符之一，那么缩小窗口的左边界之后，就不满足条件了，所以 $\textit{need}$ 加一，同时 $\textit{win}[\textit{word1}[l]]$ 减一。然后，我们将 $\textit{l}$ 加一。此时窗口为 $[l, r]$ ，那么对于任意 $0 \leq l' \lt l$ ， $[l', r]$ 都是满足条件的子串，一共有 $l$ 个，我们累加到答案中。

遍历完 $\textit{word1}$ 中的所有字符之后，我们就得到了答案。

时间复杂度 $O(n + m)$ ，其中 $n$ 和 $m$ 分别是 $\textit{word1}$ 和 $\textit{word2}$ 的长度。空间复杂度 $O(|\Sigma|)$ ，其中 $\Sigma$ 是字符集，这里是小写字母集合，所以空间复杂度是常数级别的。

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

class Solution:
    def validSubstringCount(self, word1: str, word2: str) -> int:
        if len(word1) < len(word2):
            return 0
        cnt = Counter(word2)
        need = len(cnt)
        ans = l = 0
        win = Counter()
        for c in word1:
            win[c] += 1
            if win[c] == cnt[c]:
                need -= 1
            while need == 0:
                if win[word1[l]] == cnt[word1[l]]:
                    need += 1
                win[word1[l]] -= 1
                l += 1
            ans += l
        return ans

speed

复杂度分析

指标	值
时间	Depends on the final approach
空间	Depends on the final approach

psychology

面试官常问的追问

外企场景

question_mark
Look for understanding of sliding window technique.
question_mark
Evaluate the candidate's ability to optimize space and time complexity.
question_mark
Test if the candidate can handle edge cases where word1 is smaller than word2 or has mismatched character frequencies.

warning

常见陷阱

外企场景

error
Not properly updating the frequency count as the window slides.
error
Forgetting to handle cases where word2 is longer than word1.
error
Incorrectly comparing the frequency tables, which might lead to false positives for valid substrings.

swap_horiz

进阶变体

外企场景

arrow_right_alt
What if word2 is empty? How would you handle that?
arrow_right_alt
How can you modify the approach if word1 and word2 can contain uppercase letters?
arrow_right_alt
How would the solution change if there were constraints on the number of distinct characters in word1?

help

常见问题

外企场景

继续练习

#3298 统计重新排列后包含另一个字符串的子字符串数目 II #3305 元音辅音字符串计数 I #3306 元音辅音字符串计数 II