What is the core pattern behind Minimum Number of Valid Strings to Form Target II?

The core pattern is state transition dynamic programming, where dp[i] tracks the minimum concatenations needed for each target prefix.

Can words be reused multiple times in forming the target?

Yes, each word can be used multiple times, but the DP ensures only valid prefix concatenations are counted.

What if no combination of words can form the target?

The DP will leave dp[target.length] as infinity, and the algorithm returns -1 to indicate impossibility.

How does prefix preprocessing help in this problem?

Preprocessing into a trie or hashmap allows fast checking of valid prefixes, reducing redundant DP checks and improving efficiency.

Is there a space-efficient version for very long targets?

Yes, you can optimize space by storing only recent dp states or compressing the prefix structure, but time complexity depends on prefix checks.

#3292

Hard

auto_awesome状态·转移·动态规划

LeetCode 题解工作台

形成目标字符串需要的最少字符串数 II

给你一个字符串数组 words 和一个字符串 target 。如果字符串 x 是 words 中任意字符串的前缀，则认为 x 是一个有效字符串。现计划通过连接有效字符串形成 target ，请你计算并返回需要连接的最少字符串数量。如果无法通过这种方式形成 target ，则返…

数组字符串二分查找动态规划线段树

题目描述

给你一个字符串数组 words 和一个字符串 target。

如果字符串 x 是 words 中任意字符串的前缀，则认为 x 是一个有效字符串。

现计划通过连接有效字符串形成 target ，请你计算并返回需要连接的最少字符串数量。如果无法通过这种方式形成 target，则返回 -1。

示例 1：

输入： words = ["abc","aaaaa","bcdef"], target = "aabcdabc"

输出： 3

解释：

target 字符串可以通过连接以下有效字符串形成：

words[1] 的长度为 2 的前缀，即 "aa"。
words[2] 的长度为 3 的前缀，即 "bcd"。
words[0] 的长度为 3 的前缀，即 "abc"。

示例 2：

输入： words = ["abababab","ab"], target = "ababaababa"

输出： 2

解释：

target 字符串可以通过连接以下有效字符串形成：

words[0] 的长度为 5 的前缀，即 "ababa"。
words[0] 的长度为 5 的前缀，即 "ababa"。

示例 3：

输入： words = ["abcdef"], target = "xyz"

输出： -1

提示：

1 <= words.length <= 100
1 <= words[i].length <= 5 * 10⁴
输入确保 sum(words[i].length) <= 10⁵.
words[i] 只包含小写英文字母。
1 <= target.length <= 5 * 10⁴
target 只包含小写英文字母。

lightbulb

解题思路

方法一：字符串哈希 + 二分查找 + 贪心

由于本题数据规模较大，使用“字典树 + 记忆化搜索”的方法将会超时，我们需要寻找一种更高效的解法。

考虑从字符串 $\textit{target}$ 的第 $i$ 个字符开始，最远能够匹配的字符串长度，假设为 $\textit{dist}$ ，那么对于任意 $j \in [i, i + \textit{dist}-1]$ ，我们都能够在 $\textit{words}$ 中找到一个字符串，使得 $\textit{target}[i..j]$ 是这个字符串的前缀。这存在着单调性，我们可以使用二分查找来确定 $\textit{dist}$ 。

具体地，我们首先预处理出 $\textit{words}$ 中所有字符串的每个前缀的哈希值，按照前缀长度分组存储在 $\textit{s}$ 数组中。另外，将 $\textit{target}$ 的哈希值也预处理出来，存储在 $\textit{hashing}$ 中，便于我们查询任意 $\textit{target}[l..r]$ 的哈希值。

接下来，我们设计一个函数 $\textit{f}(i)$ ，表示从字符串 $\textit{target}$ 的第 $i$ 个字符开始，最远能够匹配的字符串长度。我们可以通过二分查找的方式确定 $\textit{f}(i)$ 。

定义二分查找的左边界 $l = 0$ ，右边界 $r = \min(n - i, m)$ ，其中 $n$ 是字符串 $\textit{target}$ 的长度，而 $m$ 是 $\textit{words}$ 中字符串的最大长度。在二分查找的过程中，我们需要判断 $\textit{target}[i..i+\textit{mid}-1]$ 是否是 $\textit{s}[\textit{mid}]$ 中的某个哈希值，如果是，则将左边界 $l$ 更新为 $\textit{mid}$ ，否则将右边界 $r$ 更新为 $\textit{mid}-1$ 。二分结束后，返回 $l$ 即可。

算出 $\textit{f}(i)$ 后，问题就转化为了一个经典的贪心问题，我们从 $i = 0$ 开始，对于每个位置 $i$ ，最远可以移动到的位置为 $i + \textit{f}(i)$ ，求最少需要多少次移动即可到达终点。

我们定义 $\textit{last}$ 表示上一次移动的位置，变量 $\textit{mx}$ 表示当前位置能够移动到的最远位置，初始时 $\textit{last} = \textit{mx} = 0$ 。我们从 $i = 0$ 开始遍历，如果 $i$ 等于 $\textit{last}$ ，说明我们需要再次移动，此时如果 $\textit{last} = \textit{mx}$ ，说明我们无法再移动，返回 $-1$ ；否则，我们将 $\textit{last}$ 更新为 $\textit{mx}$ ，并将答案加一。

遍历结束后，返回答案即可。

时间复杂度 $O(n \times \log n + L)$ ，空间复杂度 $O(n + L)$ 。其中 $n$ 是字符串 $\textit{target}$ 的长度，而 $L$ 是所有有效字符串的总长度。

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

class Hashing:
    __slots__ = ["mod", "h", "p"]

    def __init__(self, s: List[str], base: int, mod: int):
        self.mod = mod
        self.h = [0] * (len(s) + 1)
        self.p = [1] * (len(s) + 1)
        for i in range(1, len(s) + 1):
            self.h[i] = (self.h[i - 1] * base + ord(s[i - 1])) % mod
            self.p[i] = (self.p[i - 1] * base) % mod

    def query(self, l: int, r: int) -> int:
        return (self.h[r] - self.h[l - 1] * self.p[r - l + 1]) % self.mod


class Solution:
    def minValidStrings(self, words: List[str], target: str) -> int:
        def f(i: int) -> int:
            l, r = 0, min(n - i, m)
            while l < r:
                mid = (l + r + 1) >> 1
                sub = hashing.query(i + 1, i + mid)
                if sub in s[mid]:
                    l = mid
                else:
                    r = mid - 1
            return l

        base, mod = 13331, 998244353
        hashing = Hashing(target, base, mod)
        m = max(len(w) for w in words)
        s = [set() for _ in range(m + 1)]
        for w in words:
            h = 0
            for j, c in enumerate(w, 1):
                h = (h * base + ord(c)) % mod
                s[j].add(h)
        ans = last = mx = 0
        n = len(target)
        for i in range(n):
            dist = f(i)
            mx = max(mx, i + dist)
            if i == last:
                if i == mx:
                    return -1
                last = mx
                ans += 1
        return ans

speed

复杂度分析

指标	值
时间	complexity is O(N * L * W) where N is target length, L is maximum word length, and W is number of words, optimized with prefix hashing. Space complexity is O(N + total prefix storage) for DP and prefix structures.
空间	Depends on the final approach

psychology

面试官常问的追问

外企场景

question_mark
Mentions state transition dynamic programming and prefix concatenation strategy.
question_mark
Asks about optimizing substring checks and reducing DP iteration overhead.
question_mark
Explores edge cases where target cannot be formed or words are very long.

warning

常见陷阱

外企场景

error
Failing to consider all valid prefixes leads to incorrect DP updates.
error
Using naive substring comparisons results in TLE for large input sizes.
error
Returning wrong index offset or missing initial DP initialization at zero.

swap_horiz

进阶变体

外企场景

arrow_right_alt
Limit reuse of words to at most once per concatenation, changing DP transitions.
arrow_right_alt
Count all distinct ways to form target instead of minimum number.
arrow_right_alt
Extend to allow words containing wildcards that match multiple characters.

help

常见问题

外企场景

继续练习

#3291 形成目标字符串需要的最少字符串数 I #3529 统计水平子串和垂直子串重叠格子的数目 #3045 统计前后缀下标对 II