Gregory Soutade
adc04bf753
Update iwla :
...
* Rework arg variable management
* Manage dry run at top level
* 'robot' property is now None by default (allow to do analysis only once)
* Add --disable-display option
2023-03-11 20:51:44 +01:00
Gregory Soutade
a0a1f42df4
Update robot detection plugin :
...
* Do analyze only one time by month
* Reactivate rule : no page view if count_hit_only_visitors is False
* Add exception for "Less than 1 hit per page" rule if a phone is used
* Check for all error codes in 400..499, not only 403 and 404
* Referer '-' now counted as null
2023-03-11 20:48:17 +01:00
Gregory Soutade
31bc67ceba
Replace feed referers by feed user agent
2023-03-11 20:42:56 +01:00
Gregory Soutade
3fdbc282c8
Remove feed parser detection by referer
2023-03-11 20:42:37 +01:00
Gregory Soutade
c8dfdd17f7
Add "compatible" as a criteria for robot
2023-02-18 08:49:14 +01:00
Gregory Soutade
a5bef4ece6
Search for "compatible" in all requests, not only the first one
2023-02-18 08:48:57 +01:00
Gregory Soutade
21a21cd68f
Add a new rule for robots : 1 page and 1 hit, but not from the same source
2023-02-04 08:40:04 +01:00
Gregory Soutade
185664850d
Add subdomains plugin
2023-01-28 09:44:43 +01:00
Gregory Soutade
6a4fd4e9c8
New rule for robot : more than 10 not modified pages in a row
2023-01-28 09:40:26 +01:00
Gregory Soutade
ac246eabe2
Find robot name in 'compatible' string and group them
2023-01-28 09:38:59 +01:00
Gregory Soutade
9c57ad3ece
Feeds : display last access date for merged feed parsers
2023-01-28 09:36:48 +01:00
Gregory Soutade
3a8c667fdc
Feeds display: Add "*" after a space in order to have flags
2023-01-28 09:35:48 +01:00
Gregory Soutade
3117aebac0
Add all_visits_enlight display plugin
2022-11-20 10:15:13 +01:00
Gregory Soutade
82993afbce
Show last access information for feeds parsers
2022-11-20 10:12:16 +01:00
Gregory Soutade
975cc66bd5
Don't launch robot analysis rules for feed parsers
2022-11-16 21:10:11 +01:00
Gregory Soutade
f8b37a625c
Rework feed merge algorithm. Allow to merge feeds based on name regular expression with merge_feeds_parsers_list conf value
2022-11-16 21:09:50 +01:00
Gregory Soutade
242bb6cabe
Fix error in None handling of unknown city/country in filtered users
2022-11-16 21:09:36 +01:00
Gregory Soutade
cce8c75118
Display viewed and no viewed data for feeds (can be set as a robot too late)
2022-11-16 21:09:23 +01:00
Gregory Soutade
ad01b48898
Change merge_one_hit_only_feeds_parsers by merge_feeds_parsers and set it to False by default
2022-11-12 19:00:14 +01:00
928e4eea5a
Move iptogeo.py util into misc
2022-11-10 20:14:18 +01:00
Gregory Soutade
41432046a1
Add Anonymize IP plugin
2022-11-10 20:06:00 +01:00
Gregory Soutade
b93c4bf470
Remove browser condition for filter users Geo IP
2022-11-10 20:04:56 +01:00
Gregory Soutade
2aad389e2b
Fix message for all visits and top countries
2022-11-10 20:04:45 +01:00
Gregory Soutade
6ce0374fe7
Display external referers in filter_users plugin
2022-11-07 21:13:49 +01:00
310fbcdb07
Remove deprecated track users plugin
2022-11-04 20:14:22 +01:00
Gregory Soutade
c0fc5a0131
Add user agent in filter users display and limit filtered users to 10 in index
2022-11-04 20:04:08 +01:00
Gregory Soutade
7ec7e0d644
Add Geo IP location in filter users
2022-11-04 20:03:52 +01:00
Gregory Soutade
c84373b2d7
Filter users filter can now be a function defined in conf
2022-11-04 20:03:33 +01:00
Gregory Soutade
4d3c2107f0
Don't save all visitors requests into database (save space and computing). Can be changed in deufalt_conf.py with keep_requests value
2022-06-23 21:16:30 +02:00
Gregory Soutade
9cee1fa71c
Move Filter Users plugin into pre_analysis (and keep display part)
2022-06-23 21:14:58 +02:00
5130b1f6d8
Bad 2to3 python conversion : map() function needs to be included into list() operator. If not, they're only analyzed once
2021-08-06 08:45:04 +02:00
Gregory Soutade
937706d88d
Fix typo
2021-06-04 07:51:15 +02:00
Gregory Soutade
0c2ac431d1
Be more strict with robots : requires at least 1 hit per viewed page
2021-06-03 08:52:04 +02:00
Gregory Soutade
4cd7712201
Fix feeds re and add 'feeds_referers' configuration
2021-06-03 08:50:46 +02:00
Gregory Soutade
8697be26ad
Update feed detector : check 'feed', 'rss' or 'atom' string in user agent
2021-04-01 08:22:52 +02:00
Gregory Soutade
0602d3ce4b
Fix little errors
2020-12-11 12:18:21 +01:00
a6f8c71bf2
Add parameter to do strToBytes in BlockTable and fix some errors with string and int
2020-12-11 09:18:50 +01:00
141f2691db
Replace track_users by filter_users plugin
2020-12-09 21:12:19 +01:00
c0e3e0a801
Update code for Python3
2020-12-09 13:24:29 +01:00
f457f4e390
Update code for Python3
2020-10-30 14:42:56 +01:00
6f9622bb91
Bugfix : iptogeo : _send_request that must handles timeout errors (empty packets) was not used
2020-05-11 10:03:07 +02:00
Gregory Soutade
bb268114b2
Make backup before compressing (low memory servers)
...
Fix error : Call post hook plugins even in display only mode
Don't compute unordered hits (remove pasts if they are found after current)
Remove tags in stats diff
Don't do geolocalisation is visitor is not valid
Don't try to find search engine on robots
Update robot check rules
Add top_pages_diff plugin
2019-08-30 07:50:54 +02:00
Gregory Soutade
8255be7ec8
Get GEO code for all visits (not only valid ones)
2017-09-06 07:43:55 +02:00
Gregory Soutade
9f9bfebe7a
ip_to_geo : reset connection after 50 requests
2017-09-05 07:34:27 +02:00
Gregory Soutade
007be71ad6
New format for (not_)viewed pages/hits and bandwidth that are now recorded by day (in a dictionnary were only element 0 is initialized). Element 0 is the total. WARNING : not backward compatible with previous databases.
2017-08-24 07:55:53 +02:00
Gregory Soutade
3b3ae1ea3e
Fix KeyError : geo attribute can not exists
2017-08-23 20:10:15 +02:00
Gregory Soutade
68a67adecc
Add one more rule to robot detection : more than ten 404 pages viewed
2017-05-25 21:04:18 +02:00
Gregory Soutade
038e485081
Prevent from error when decoding key phrases
2017-01-29 09:12:47 +01:00
Gregory Soutade
dc4695f539
Add robot bandwidth display plugin
2016-09-25 20:38:49 +02:00
e805e59c10
Remove crawler from feed parsers
2016-08-20 13:08:02 +02:00