C ++ vs .NET正则表达式性能

C++ vs .NET regex performance

Propted by a comment from Konrad Rudolph on a related question，I wrote the following program to benchmark regular expression performance in F 350；：

1
2
3
4
5
6
7
8
9

open System.Text.RegularExpressions
let str = System.IO.File.ReadAllText"C:\\Users\\Jon\\Documents\\pg10.txt"
let re = System.IO.File.ReadAllText"C:\\Users\\Jon\\Documents\
e.txt"
for _ in 1..3 do
let timer = System.Diagnostics.Stopwatch.StartNew()
let re = Regex(re, RegexOptions.Compiled)
let res = Array.Parallel.init 4 (fun _ -> re.Split str |> Seq.sumBy (fun m -> m.Length))
printfn"%A %fs" res timer.Elapsed.TotalSeconds

And the equivalent in C++：

ZZU1

两个方案的负荷为单一条纹(我用《圣经》的一份副本)，制作了一个非三维单一条纹规则\w?\w?\w?\w?\w?\w，并将四条条纹分割成平行的平行线，使用的是分割条纹长度的总和(以避免分割)。

运行在视觉工作室(with MP and openmp enabled for the C+)，在释放目标64-bit，the C+takes 43.5s and the F 35s；takes 3.28s(over 13x faster).这一点我并不感到惊讶，因为我相信，在C++STDLIB解释的地方，净JIT编纂了原住民法典，但我喜欢一些同类评论。

复制这个网站码到您的网站上以设置一个投票箱在您的网站上。

编辑：Billy Oneal指出，净可对\w有不同的解释。

1	[0-9A-Za-z_]?[0-9A-Za-z_]?[0-9A-Za-z_]?[0-9A-Za-z_]?[0-9A-Za-z_]?[0-9A-Za-z_]

This actually makes the net code substantially faster(C++is the same)，reducing the time from 3.28s to 2.38s for F ；(over 17x faster).