OK it's difficult to bound well the worst case running time of one my attempt, upper bound for nxn board is O(n^{12}). I tooks 2-10 minutes on levels near 650 ending 8. The attempt fails on thesse levels by about 10 squares. (One of about 10 attempts wents through).
... but the probability going through decreases. For other levels the attempts fail very rarely (3 times alltogether for *7 levels till 677, never for other than *7,*8 levels).
I could decrease the upper bound by chosing better data structure, let us see it will be needed ...
OK

removing logging reduced the attempt time to about 1s

so ...
... I had to increase the stack size for 898 level ...
On levels above 900, fails are by about 40 squares so several hundred attempts are required ...
Level 978 requires more than 10000 attempts ... I wish I have codded it better considering garbage collector requirements ... let us see the level 1000 would be beated overnight (or 2) before I would have time to improve it... (I have one other cheap improvement on mind).
After finishing the level 1000, I have removed all the debug logging and run it once again from level 0. I have mesured time starting after parsing the html, but before the graph is created, I have counted number of required tries.
first level requiring more than 1s (1329ms) was 124, next (1466ms) was 138.
Levels till 200 were getting slower but required less than 3s.
Starting from level 200 times were near 20ms.
First level requiring more than 1 try was 337 requiring 2 tries with solution time 31ms, that repeated on 348. Third level requiring more than 1 try was 398 with 9 tries and solution time 46ms.
First level above 200 requiring more than 1s (1170ms) was level 788 with 60 tries.
Next was 798 with (6084ms) and 227 tries.
Next was 818 with (35365ms) and 491 tries.
828 required (2309ms) and 75 tries.
838 required (2075ms) and 93 tries.
848 required (3276ms) and 133 tries.
858 required (2714ms) and 133 tries.
868 required (5039ms) and 172 tries.
878 required (22922ms) and 436 tries.
888 required (13932ms) and 527 tries.
898 required (430922ms) and 935 tries.
908 required (5026ms) and 185 tries.
918 required (139012ms) and 3807 tries.
928 required (171813ms) and 3030 tries.
938 required (297257ms) and 2553 tries.
948 required (378847ms) and 12224 tries.
968 required (785543ms) and 15395 tries.
978 required (12457897ms) and 88439 tries!!!
988 required (15865ms) and 335 tries.
998 required (1687545ms) and 38999tries.
So for me 978 looked most difficult with 998 close to it.
On first run ...
998 required 13452 tries,
988 required 5806 tries,
978 required 76706 tries,
968 required 3995 tries.
what looks like confirmation of it.
Of course other algorithms could have problems with other graph features ...