Darknet: Request: Documentation for LSTM

Created on 28 Nov 2019  路  5Comments  路  Source: AlexeyAB/darknet

Hello,
I would like to ask if you could write something on how to use yolo with lstm on sequential data?

It seems yolo has lstm support:
https://github.com/AlexeyAB/darknet/blob/master/cfg/lstm.train.cfg

Is the lstm config only for training(adjusting the weights) and the predictions are done with a standard yolo config?

Thank you very much + Greetings,
Holger

Most helpful comment

But the frames from video 2 are not sequential to frames from video1 but they are listed in the same train.txt. How is this supposed to work - do i need to insert a stop word or do i need to have two train.txt and resume training?

No. Rare context changes are possible.

If an object is present in one frame and occluded from another object in the next frame, i should still label it?

Yes. If you want to detect occluded objects.

Hmm so the "border" between last frame of video1 and first frame of video2 can be ignored and does not have to be treated special(in separate train files for example)?

Yes.

And the labeling of hidden/occlluded object makes only sense if they appear again on later frames i guess?

It makes sense if you want to detect occluded objects.

All 5 comments

Sorry i need to ask again - this is not clear to me.
My current understanding :

Until here its clear i think.
What confusing me most is this statement from you:

train.txt - first 80% of frames (80% from video1 + 80% from video 2, if you use frames from 2 videos)

So in train.txt i list 80% of the frames from video 1
This is followed by 80% of the frames from video 2
But the frames from video 2 are not sequential to frames from video1 but they are listed in the same train.txt. How is this supposed to work - do i need to insert a stop word or do i need to have two train.txt and resume training?

Or do i just think too complicated again and its just fine?

Thank you very much again,
Greetings, Holger

Fine.

Hmm so the "border" between last frame of video1 and first frame of video2 can be ignored and does not have to be treated special(in separate train files for example)?
I am just a bit paranoid as i failed a lot during my last training attempts and want to prevent stuff.

On the other hand if video1 end with a black frame(usually it does) and video2 starts with a black frame its even perfect, so nothing can go wrong there guess.

Thank you both.
And the labeling of hidden/occlluded object makes only sense if they appear again on later frames i guess?

But the frames from video 2 are not sequential to frames from video1 but they are listed in the same train.txt. How is this supposed to work - do i need to insert a stop word or do i need to have two train.txt and resume training?

No. Rare context changes are possible.

If an object is present in one frame and occluded from another object in the next frame, i should still label it?

Yes. If you want to detect occluded objects.

Hmm so the "border" between last frame of video1 and first frame of video2 can be ignored and does not have to be treated special(in separate train files for example)?

Yes.

And the labeling of hidden/occlluded object makes only sense if they appear again on later frames i guess?

It makes sense if you want to detect occluded objects.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

louisondumont picture louisondumont  路  3Comments

off99555 picture off99555  路  3Comments

Greta-A picture Greta-A  路  3Comments

siddharth2395 picture siddharth2395  路  3Comments

qianyunw picture qianyunw  路  3Comments