Hi authors,
First of all, thank you for your fantastic work and for releasing this valuable benchmark to the community! I really enjoyed reading your paper.
I am currently planning to evaluate my own model on RIVER to compare it with the baselines mentioned in your paper. However, I noticed that the evaluation scripts don't seem to be available in the repository yet.
To ensure a fair and standardized comparison, could you please let me know if there is a plan or timeline to open-source the evaluation code?
Thanks again for your time and your great contribution!