I propose to create a new table (user_repo_units
) to summarize all permissions a user requires to access any repository; this table should be derived from all the possible permission sources (e.g. team membership, admin status, repo visibility, repo active units, etc.). Any changes in the system that modifies the user's accessibility to any repo should update said table. Then, many queries can use the table for user access checking instead of a complex set of golang-side code.
The table would have a structure like:
type UserRepoUnits struct {
UserID int64 `xorm:"pk"`
RepoID int64 `xorm:"pk INDEX"`
Type UnitType
Mode AccessMode
}
Units accessible for all users (e.g. UnitTypeCode
on public repositories) will not have a record for each user but one for the user 0
instead (may be -1
is better?). This should reduce the number of records in the table considerably.
The table can then be used in queries. For example, to get all open issues (UnitTypeIssues
= 2) the user 1234
can see (AccessModeRead
= 1), we could do:
select issue.*
from issue
where (select max(mode) from user_repo_units
where user_repo_units.repository_id = issue.repository_id
and user_repo_units.user_id in (1234, 0)
and user_repo_units.type = 2) >= 1;
and issue.is_closed = false;
No team membership check required, no ownership or admin check required. All in the same (pretty much standardized) query.
And we could also support collabrators unit permissions if this. Currently only organization's repositories could have unit permissions, but not individual's.
And this may result in the inconsistent between team_unit and user_repo_units. But even that, I also support to add the extra table to do that. We have to be careful of the consistent of database.
Couldn't we define a view wich will be optimize by the SQL engine instead of a table that need to be tightly kept uptodate ?
@sapk That would be a very complex view and will not likely be much optimizable. Views only translate as subqueries; they don't have any kind of pre-processing.
There's however the concept of a materialized view (a.k.a. indexed view), which is a read-only table automatically maintained by the rdbms based on a view definition. Those would be very convenient here, but I don't think they're available on all of our supported databases (not in sqlite3) and they have some important limitations and caveats as well, like not being immediately updated (e.g. in PostgreSQL a REFRESH MATERIALIZED VIEW mymatview;
statement must be ran in order to update its contents).
This SO answer explains how views can improve a query performance, but only if they are indexed (materialized) views.
@guillep2k The optimization (cache) could be done on gitea part like using xorm cache over the view and clearing it when rights are changed. I am not against the table, I just fear that it would become a hazardous code to maintain so if it could be a more simple solution it would be better. In fact, we can go first with the table and later move to a view and just remove the code that updated the table.
Most helpful comment
And we could also support collabrators unit permissions if this. Currently only organization's repositories could have unit permissions, but not individual's.
And this may result in the inconsistent between team_unit and user_repo_units. But even that, I also support to add the extra table to do that. We have to be careful of the consistent of database.