Commit Graph

142 Commits

Author SHA1 Message Date
Gregory Soutade
adc04bf753 Update iwla :
* Rework arg variable management
  * Manage dry run at top level
  * 'robot' property is now None by default (allow to do analysis only once)
  * Add --disable-display option
2023-03-11 20:51:44 +01:00
Gregory Soutade
a0a1f42df4 Update robot detection plugin :
* Do analyze only one time by month
  * Reactivate rule : no page view if count_hit_only_visitors is False
  * Add exception for "Less than 1 hit per page" rule if a phone is used
  * Check for all error codes in 400..499, not only 403 and 404
  * Referer '-' now counted as null
2023-03-11 20:48:17 +01:00
Gregory Soutade
31bc67ceba Replace feed referers by feed user agent 2023-03-11 20:42:56 +01:00
Gregory Soutade
3fdbc282c8 Remove feed parser detection by referer 2023-03-11 20:42:37 +01:00
Gregory Soutade
c8dfdd17f7 Add "compatible" as a criteria for robot 2023-02-18 08:49:14 +01:00
Gregory Soutade
a5bef4ece6 Search for "compatible" in all requests, not only the first one 2023-02-18 08:48:57 +01:00
Gregory Soutade
21a21cd68f Add a new rule for robots : 1 page and 1 hit, but not from the same source 2023-02-04 08:40:04 +01:00
Gregory Soutade
185664850d Add subdomains plugin 2023-01-28 09:44:43 +01:00
Gregory Soutade
6a4fd4e9c8 New rule for robot : more than 10 not modified pages in a row 2023-01-28 09:40:26 +01:00
Gregory Soutade
ac246eabe2 Find robot name in 'compatible' string and group them 2023-01-28 09:38:59 +01:00
Gregory Soutade
9c57ad3ece Feeds : display last access date for merged feed parsers 2023-01-28 09:36:48 +01:00
Gregory Soutade
3a8c667fdc Feeds display: Add "*" after a space in order to have flags 2023-01-28 09:35:48 +01:00
Gregory Soutade
3117aebac0 Add all_visits_enlight display plugin 2022-11-20 10:15:13 +01:00
Gregory Soutade
82993afbce Show last access information for feeds parsers 2022-11-20 10:12:16 +01:00
Gregory Soutade
975cc66bd5 Don't launch robot analysis rules for feed parsers 2022-11-16 21:10:11 +01:00
Gregory Soutade
f8b37a625c Rework feed merge algorithm. Allow to merge feeds based on name regular expression with merge_feeds_parsers_list conf value 2022-11-16 21:09:50 +01:00
Gregory Soutade
242bb6cabe Fix error in None handling of unknown city/country in filtered users 2022-11-16 21:09:36 +01:00
Gregory Soutade
cce8c75118 Display viewed and no viewed data for feeds (can be set as a robot too late) 2022-11-16 21:09:23 +01:00
Gregory Soutade
ad01b48898 Change merge_one_hit_only_feeds_parsers by merge_feeds_parsers and set it to False by default 2022-11-12 19:00:14 +01:00
928e4eea5a Move iptogeo.py util into misc 2022-11-10 20:14:18 +01:00
Gregory Soutade
41432046a1 Add Anonymize IP plugin 2022-11-10 20:06:00 +01:00
Gregory Soutade
b93c4bf470 Remove browser condition for filter users Geo IP 2022-11-10 20:04:56 +01:00
Gregory Soutade
2aad389e2b Fix message for all visits and top countries 2022-11-10 20:04:45 +01:00
Gregory Soutade
6ce0374fe7 Display external referers in filter_users plugin 2022-11-07 21:13:49 +01:00
310fbcdb07 Remove deprecated track users plugin 2022-11-04 20:14:22 +01:00
Gregory Soutade
c0fc5a0131 Add user agent in filter users display and limit filtered users to 10 in index 2022-11-04 20:04:08 +01:00
Gregory Soutade
7ec7e0d644 Add Geo IP location in filter users 2022-11-04 20:03:52 +01:00
Gregory Soutade
c84373b2d7 Filter users filter can now be a function defined in conf 2022-11-04 20:03:33 +01:00
Gregory Soutade
4d3c2107f0 Don't save all visitors requests into database (save space and computing). Can be changed in deufalt_conf.py with keep_requests value 2022-06-23 21:16:30 +02:00
Gregory Soutade
9cee1fa71c Move Filter Users plugin into pre_analysis (and keep display part) 2022-06-23 21:14:58 +02:00
5130b1f6d8 Bad 2to3 python conversion : map() function needs to be included into list() operator. If not, they're only analyzed once 2021-08-06 08:45:04 +02:00
Gregory Soutade
937706d88d Fix typo 2021-06-04 07:51:15 +02:00
Gregory Soutade
0c2ac431d1 Be more strict with robots : requires at least 1 hit per viewed page 2021-06-03 08:52:04 +02:00
Gregory Soutade
4cd7712201 Fix feeds re and add 'feeds_referers' configuration 2021-06-03 08:50:46 +02:00
Gregory Soutade
8697be26ad Update feed detector : check 'feed', 'rss' or 'atom' string in user agent 2021-04-01 08:22:52 +02:00
Gregory Soutade
0602d3ce4b Fix little errors 2020-12-11 12:18:21 +01:00
a6f8c71bf2 Add parameter to do strToBytes in BlockTable and fix some errors with string and int 2020-12-11 09:18:50 +01:00
141f2691db Replace track_users by filter_users plugin 2020-12-09 21:12:19 +01:00
c0e3e0a801 Update code for Python3 2020-12-09 13:24:29 +01:00
f457f4e390 Update code for Python3 2020-10-30 14:42:56 +01:00
6f9622bb91 Bugfix : iptogeo : _send_request that must handles timeout errors (empty packets) was not used 2020-05-11 10:03:07 +02:00
Gregory Soutade
bb268114b2 Make backup before compressing (low memory servers)
Fix error : Call post hook plugins even in display only mode
Don't compute unordered hits (remove pasts if they are found after current)
Remove tags in stats diff
Don't do geolocalisation is visitor is not valid
Don't try to find search engine on robots
Update robot check rules
Add top_pages_diff plugin
2019-08-30 07:50:54 +02:00
Gregory Soutade
8255be7ec8 Get GEO code for all visits (not only valid ones) 2017-09-06 07:43:55 +02:00
Gregory Soutade
9f9bfebe7a ip_to_geo : reset connection after 50 requests 2017-09-05 07:34:27 +02:00
Gregory Soutade
007be71ad6 New format for (not_)viewed pages/hits and bandwidth that are now recorded by day (in a dictionnary were only element 0 is initialized). Element 0 is the total. WARNING : not backward compatible with previous databases. 2017-08-24 07:55:53 +02:00
Gregory Soutade
3b3ae1ea3e Fix KeyError : geo attribute can not exists 2017-08-23 20:10:15 +02:00
Gregory Soutade
68a67adecc Add one more rule to robot detection : more than ten 404 pages viewed 2017-05-25 21:04:18 +02:00
Gregory Soutade
038e485081 Prevent from error when decoding key phrases 2017-01-29 09:12:47 +01:00
Gregory Soutade
dc4695f539 Add robot bandwidth display plugin 2016-09-25 20:38:49 +02:00
e805e59c10 Remove crawler from feed parsers 2016-08-20 13:08:02 +02:00